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SUBJECT INDEX 


PREFACE 


The object of this book is to introduce the three main uses of 
group theory in quantum mechanics, which are: firstly, to label 
energy levels and the corresponding eigenstates; secondly, to discuss 
qualitatively the splitting of energy levels as one starts from an 
approximate Hamiltonian and adds correction terms; and thirdly, 
to aid in the evaluation of matrix elements of all kinds, and in 
particular to provide general selection rules for the non-zero ones. 
The theme is to show how all this is achieved by considering the 
symmetry properties of the Hamiltonian and the way in which 
these symmetries are reflected in the wave functions. In Chapter I 
the necessary mathematical concepts are introduced in as elemen- 
tary and illustrative a manner as possible, with the proofs of some of 
the fundamental theorems being relegated to an appendix. The 
three uses of group theory above are illustrated in detail in Chapter 
II by a fairly quick run through the theory of atomic energy levels 
and transitions. This topic is particularly suitable for illustrative 
purposes, because most of the results are familiar from the usual 
vector model of the atom but are derived here in a rigorous and 
precise way. Also most of it, e.g. the introduction of spin functions 
and the exclusion principle, is fundamental to all the later more 
advanced topics. Chapter III is a repository for the theory of group 
characters, the crystallographic point-groups and minor points 
required in some of the later applications. Thus, after selected 
readings from chapter III according to his field of interest, the 
reader is ready to jump immediately to any of the applications of 
the theory covered in later chapters, namely: further topics in the 
theory of atomic energy levels (Chapter IV), the electronic structure 
and vibrations of molecules (Chapter V), solid state physics (Chap- 
ter VI), nuclear physics (Chapter VII), and relativistic quantum 
mechanics (Chapter VITI). 

The level of the text is that of a course for research students in 
physics and chemistry, such as is now offered in many Universities. 
A previous course in quantum theory, based on a text such as 
Schiff Quantum Mechanics, is. assumed, but the matrix algebra 
required is included as an appendix. In selecting the material for 
the applications in various branches of physics and chemistry in 
Chapters IV to VIII, I have restricted myself as far as possible to 
topics satisfying three criteria: (i) the topics should be simple 
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applications that illustrate basic principles, rather than complicated 
examples designed to overawe the reader with the power of group 
theory; (ii) the material should be intrinsically interesting and of 
the sort that is suitable for inclusion in a general course of advanced 
quantum mechanics; and (iii) topics must not involve too much 
specialized background knowledge of particular branches of physics. 
The view adopted throughout is that group theory is not just a 
specialized tool for solving a few of the more difficult and intricate 
problems in quantum theory. In advanced quantum mechanics 
practically all general statements that can be made about a com- 
plicated system depend on its symmetry properties, and the use of 
group representations is just a systematic, unified way of thinking 
about and exploiting these symmetries. For this reason I have not 
hesitated to include simple results for which one could easily produce 
ad hoc proofs from first principles: indeed, it must always remain 
true that the use of group theory could be circumvented by detailed 
algebraic considerations on almost all occasions. However, the 
author is convinced that the essential ideas of group theory are 
sufficiently simple to make the time spent on acquiring this way of 
thinking well worth while. 

A series of examples is appended to each section. Some of these 
are simple drill in the concepts introduced in the section; others, 
particularly in later chapters, indicate extensions of the theory and 
further applications. Those marked with an asterisk are more 
difficult or require additional reading, and are often suitable as 
topics for review essays (alias term papers). 

With the three criteria for selection mentioned above, it has 
of course been quite impossible to do real justice to any of the 
applications to various branches of physics and chemistry that are 
touched on in Chapters IV to VIII. This appears to me unavoidable 
because of the amount of background knowledge required for many 
applications. It merely highlights the fact that in each of these 
specialized subjects there is a need for a monograph which uses 
group theory from the beginning as naturally and as freely as the 
Schrédinger equation itself. In this field the chemists have already 
led the way,t and the author hopes that the present book may 
hasten the day when the same applies in physics by providing a 
convenient basic reference text. 

It is a pleasure to acknowledge my indebtedness to Professor 
B. L. Van der Waerden whose elegant book first inspired my interest 
in this subject. Also I am very grateful to Dr. S. F. Boys, 


+ See Eyring, Walter and Kimball (1944) Quantum Chemistry; and Wilson, 
Decius and Cross (1955) Molecular Vibrations. 
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Dr. G. Chew, Dr. R. Karplus, Dr. M. A. Ruderman, Dr. M. Tinkham 
and Mr. D. Twose, who have either patiently helped me to understand 
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NOTATION 


Note: e is taken as the charge on the proton: all angular momentum 
operators such as L = (Lz, Ly, Lz) have the dimensions of angular 
momentum and thus contain a factor % (except in § 18), whereas 
the quantum numbers L, M_, etc., are of course pure numbers. 


Chapter I 
SYMMETRY TRANSFORMATIONS IN 
QUANTUM MECHANICS 


1. The Uses of Symmetry Properties 


Although this book has been titled “Introduction to the Present 
Use of Group Theory in Quantum Mechanics” in accordance with 
customary usage, a rather more descriptive title would have been 
“The Consequences of Symmetry in Quantum Mechanics”. The fact 
that these symmetry properties form what mathematicians have 
termed “groups” is really incidental from a physicist’s point of view, 
though it is vital to the mathematical form of the theory. It is in 
fact the symmetries of quantum mechanical systems that we shall 
be interested in. 

The following three simple examples illustrate in a preliminary 
way what is meant by symmetry properties and what their main 
consequences are. 

(i) It can be shown that the wave functions (r,, r,) (without 
spin) of a helium atom are of two types, symmetric and anti- 
symmetric, according to whether 


Pry, 02) = (Le, F,) or Y(T1, P2) = —(2, 1), 


where r, and r, are the position vectors of the two electrons (Schiff 
1955, p. 234). The, corresponding states of the atom are also referred 
to as symmetric and anti-symmetric. Thus the eigenfunctions 
turn out to have well defined symmetry properties which can there- 
fore be used in classifying and distinguishing all the different 
eigenstates. 

(ii) There are three 2p wave functions for a hydrogen atom, 


$(2p2) = af(r), $(2py) = afr), —- P(2pz) = 2f(r), (1-1) 
where f(r) is a particular function of r = |r| only (Schiff 1955, 
p. 85). Now in a free atom there are no special directions and we 
can choose and label the z-, y- and z-axes as we please, so that the 
three functions (1.1) must all correspond to the same energy level. 
If, however, we apply a magnetic field in some particular direction, 
the argument no longer holds, so that we may expect the energy 
level to be split into several different levels, up to three in number. 
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In this kind of way the symmetry properties of the eigenfunctions 
can determine the degeneracy of an energy level, and how such a 
degenerate level may split as a result of some additional perturbation. 
(iii) The probability that the outer electron of a sodium atom 
jumps from the state 4, to the state y, with the emission of radiation 
polarized in the x-direction is proportional to the square of 


m= |[ i) | Ya* @ py de dy dz (1.2) 
(Schiff 1955, p. 253). If the two states are the 4s and 3s ones, , 
and 7, are functions of r only. To calculate M in this case, we 
make the change of variable z’ = —z in (1.2) and obtain M = —WM, 
ie. M(4s, 3s) = 0. This transition probability is therefore deter- 
mined purely by symmetry. The situation is rather different when 
the transition probability is not zero. Suppose #, and y%, are the 
4p, and 3s wave functions zf,(r) and f,(r). Then (1.2) becomes 


oOo 


M(4pz, 38) = f ff fik(r) 2* falr) dx dy de. (1.3) 


By making the change of variable x’ = y, y’ = x, the x? in (1.3) 
can be replaced by y? or similarly by z?. Thus by addition 
eo ao a 
M(4pz, 38) =4 { [ [fter) Phlr)dedy de (1.4) 
—-o eo 


—-oO — 


Similarly the probabilities for all possible transitions from any 4p 
state to the 3s state or vice versa, with the emission or absorption 
of radiation, polarized circularly or linearly in any direction, can be 
reduced to the integral occurring in (1.4), the simple numerical factor 
in front being determined purely vy the particular direction and 
4p state chosen. Symmetry properties thus establish the relative 
magnitudes of several matrix elements of the form (1.2), their 
absolute values being then determined by the value of one integral. 
This type of argument explains why the intensities of the various 
components of a composite spectral line are often observed to bear 
simple ratios to one another. 

These examples serve to illustrate what is generally true, namely 
that symmetry properties allow us to classify and label the eigen- 
states of a quantum mechanical system. They enable us to discuss 
qualitatively what splittings we may expect in a degenerate energy 
level under some perturbation. They help in calculating transition 
probabilities and other matrix elements, and, in particular, in setting 
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up selection rules stating when these quantities are zero. In the 
following sections we shall develop these kinds of symmetry argu- 
ment in a systematic fashion, and shall see how they can be used 
for the above three purposes in situations that are less elementary 
than the examples given above. 

The real importance of symmetry arguments in such situations 
lies in the fact that for systems of interest the Schrodinger equation 
is usually too complicated to be solved analytically or even numeri- 
cally without making gross approximations. For instance, for an 
atom with m electrons the equation contains 4n variables (including 
spin) which are not separable. However, the symmetry properties 
of the equation may be relatively simple, so that symmetry argu- 
ments can easily be applied to the problem. Another important 
point about symmetry arguments is that they are based on the 
symmetry of the Schrédinger equation itself, so that they do not 
involve approximations, in particular those used to obtain approxi- 
mate eigenfunctions of the equation. In fact the beauty of the 
method lies in the fact that, for instance, an n electron problem can 
often be treated as simply and as rigorously as a one electron prob- 
lem. At the present time the most spectacular illustrations of these 
two aspects of symmetry arguments occur in nuclear and funda- 
mental particle physics. The shell-model theory of the energy 
levels of nuclei has been developed, with selection rules for various 
transitions, etc., all without an exact knowledge of the interaction 
between two nucleons. Similarly it is possible to discuss tentatively 
the relationships between the various fundamental particles and 
give selection rules for transitions between them, which are based 
purely on symmetry ideas, such as spin, charge conjugation, isotopic 
spin and parity, without the slightest understanding of the field 
equations describing the interactions of all these particles. ’ 


2. Expressing Symmetry Operations Mathematically 


Many of the symmetry properties that we shall be concerned with 
involve rotations, so that we shall start by considering how a 
physical operation such as rotating a system is expressed mathe- 
matically. 

Consider a body with a point P on it which has co-ordinates 
(x, y, 2). If we rotate the body clockwise by an angle « (Fig. 1), 
i.e. we rotate by — « about the z-axis in the conventional sense, the 
point P moves to the position P’(X, Y, Z), where 


OA = OC = OBcos« — BP’ sine 
AP =CP’ = OBsin« + BP’ cos «, (2.1) 
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Le. 


x= X cosa — Y sina, 


y= Xsina + Y cosa, (2.2) 
a= 2; 


P (ay,2) 


P'(X,%2Z) 


~ 
“a 


‘ 
x 
C4 n 


Fig. 1. Rotation of a point P to P’. 


Here and elsewhere the x-, y- and z-axes are chosen to form a right- 
handed set. However, instead of rotating P, we can also consider 
the body and P as fixed, and refer all co-ordinates to a new pair of 
axes OX and OY which make an angle + « with Ox and Oy (Fig. 2). 


Fie. 2. Rotation of axes. 


We have analogously to (2.1) 


OE = OD cos «a — DP sin « 
EP =OD sina + DP cos «, 


so that the co-ordinates (X, Y, Z) of P referred to the new axes 
are related to (x, y, z) again by (2.2). 

Thus the single transformation (2.2) can represent either the change 
in the co-ordinates of a point when we rotate a body by an angle 
—a, or the change in the co-ordinates of a fixed point when we 
rotate the co-ordinate axes by an angle +a. The close relationship 
between these two operations is directly evident from the similarity 
between Figs. 1 and 2. The two different points of view also arise 
when considering the symmetry properties of a physical system. 
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Consider for instance a perfectly round plate without any markings 
on it: we say it is symmetrical about a vertical axis through its 
centre, say the z-axis. We can express this more precisely by saying 
that if we rotate the plate about its axis, we cannot tell that we 
have rotated it because it is completely round with no markings on 
it. On the other hand we could also say that for a fixed position of 
the plate, the various physical properties such as moments of 
inertia associated with the z- and y-axes must be the same, no 
matter in what directions these axes are chosen. In this example 
the first approach is perhaps more natural, but when discussing the 
symmetry of the Schrédinger equation for a physical system we 
shall adopt the second point of view. Anticipating a little, we shall 
be considering a given equation and the forms it takes when expressed 
in terms of different variables like x, y, z and X, Y, Z which cor- 
respond to using different co-ordinate axes. There are two reasons 
for this choice. Firstly, the Schrédinger equation is a mathematical 
relation and not like a plate so that we cannot rotate it in quite 
the same sense, though we could, of course, write down the equation 
for the rotated physical system. Expressing an equation in terms of 
different sets of co-ordinates is a more familiar concept. Secondly, 
we shall be considering some transformations of co-ordinates that 
have no simple physical analogue. For instance, we can carry out a 
rotational transformation of spin co-ordinates without altering the 
position vectors r; of the electrons in an atom, but what does it 
mean physically to rotate an atom in spin space while holding it 
fixed in ordinary space? Nevertheless the transformations of 
co-ordinates which we shall apply to the Schrédinger equation will 
usually be suggested by and linked with the physical symmetry of 
the system in an obvious way. 

When discussing linear transformations of co-ordinates, it is 
convenient to refer to them by a single symbol such as T. For 
instance, we shall call the transformation (2.2) the transformation R, 
or because it corresponds to a rotation, the rotation R. If it is neces- 
sary to be specific about the angle of rotation, we shall call (2.2) 
the rotation R(«, z) of +a about the z-axis because this sign cor- 
responds to the change-of-axes point of view which we are adopting. 
We have already discussed in connection with Fig. 2 the effect of 
applying a transformation such as # on the co-ordinates of a point, 
and we shall now make the following preliminary definition of what 
it means to apply R(«, z) to a function of x, y, z. In § 5, reasons will 
appear for replacing this definition by a slightly enlarged concept. 
Applying the transformation R(«, z) (2.2) to a function f(x, y, 2) 
means to substitute the expressions (2.2) for x, y, z in the function 


6 GROUP THEORY IN QUANTUM MECHANIOS 


and thus express f in terms of X, Y, Z. This results in a function of 
X, Y, Z which in general displays a different functional form from 
f(x, y, 2). For instance applying R(«, z) to the function (x — y)?, 
we obtain. 
(« — y)? = [(X cos « — ¥ sin «) — (X sin « + Y cos «)]? 

= [X(cos « — sin «) — Y(cos « + sin «)]?, (2.3) 
which is a different function of X, Y, Z. Similarly we can apply 
a transformation to each side of an equation. For instance the 
equation 


: | 
aoe — y)*? = Ae — y) (2.4) 


becomest 


7) 
(cos o - — sin a 57) [loos « — sin a) — Y(cos « + sin «)}? 


= 2[X(cos a — sin «) — Y(cos « ++ sin «)], (2.5) 


which is still a correct equation as can easily be verified. 


PROBLEMS 


2.1 Apply the transformation R(«, z) (equation (2.2)) to each 
of the following functions: (a) exp x; (b) (x + ty)*; (c) x? + y? + 2°; 
(a) af (r), uf (7), 2f(")- 

2.2 Write down the linear transformation that corresponds to a 
rotation of « about the y-axis, and apply it to each of the functions 
of problem 2.1. 

2.3 The Schrédinger equation for a simple harmonic oscillator 
of frequency w is 

i? d® 22 E. 
(-5 at + 4mw*x Je = Hyp(z), 


where (x) is an eigenfunction belonging to the energy value £. 
By operating on the equation with the transformation 7 = —X, 
show that ys(—x) is also an eigenfunction belonging to the same energy 
level and so are (x) + %(—a) and (x) — ¥(—2). 


3. Symmetry Transformations of the Hamiltonian 


We shall now apply linear transformations like R(«, z) (2.2) to 
the time-independent Schrédinger equation 
Hh = Ef, (3.1) 


+ How to transform @/8z is shown in any elementary calculus text. 
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where # is the Hamiltonian operator and E the energy value 
belonging to the eigenfunction ¥. It is convenient to consider 
first the effect of a transformation on the Hamiltonian #. 

The Hamiltonian for an atom with n electrons, considering the 
nucleus as fixed and omitting spin dependent terms, is (Schiff 
1955, p. 284) 


R2 n n ne2 n n 
rae he Bie Soe 
= aL Ep 
+ a t<j 
where m is the mass of an electron, e the charge on a proton, and 
a oe  @& 
Berd By * Bae 


reg? = (x — 2y)® + (ys — ys)? + (Ce — %)?. (3.4a) 
If we apply the transformation R(«, z) (2.2) to the co-ordinates 
(24, yt, 21) of each of the x electrons, we have 
rg? = ay? + ys? + 24? 
= (X; cos a — Y;sin «)? + (X;sin « + Yj cos a)? + Z,? 
= X;2 + Yi? + Z;?. 
Similarly 
ray? = (Xe — Xj)? + (Ye — Ys)? + (Zi — Zy)*, (3-4) 
and it can easily be shown thatt 
o2 oe? o2 o2 o2 oe? 
Gea Gye t Get — Xe T OVE BBE 


e2 
= 3.2 
ry ee) 


V2 = (3.3) 


(3.5) 


Thus substituting these relations into (3.2), we see that the Hamil- 
tonian has precisely the same form when expressed in terms of the 
(Xi, Yi, Z:) co-ordinates as in terms of the (i, yi, 2i) co-ordinates, 
i.e. 


KH (xi, Yt» 21) = H (Xi, Yi, Zi). (3.6) 


This is expressed by saying that the transformation R(a, z) leaves 
(3.2) unchanged, or R(«, z) leaves H invariant, or # is invariant 
under R(a, z), or R(a, z) is a symmetry transformation of #. A 
symmetry transformation of a Hamiltonian is defined as a linear 

+ The transformation of differential operators is discussed in any elementary 
calculus text. 
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transformation of co-ordinates which leaves that Hamiltonian invariant 
in the sense of equation (3.6). 

The reason for applying linear transformations like R(a, z) (2.2) 
to a Hamiltonian now becomes a little clearer. We have seen that 
R(«, z) leaves the Hamiltonian (3.2) invariant. However, R(«, z) 
applied to the eigenfunctions of the Hamiltonian does not in general 
leave them invariant. Consider for instance the 2p wave functions 
for a hydrogen atom (example (ii) of §1). R(a, z) applied to 2f(r) 
gives (X cos «— Y sin «) f(R) which has a different functional 
form. In particular for « = 90° we obtain — Yf(R) so that R(a, z) 
has changed one eigenfunction into another. More generally con- 
sider a Schrédinger equation 


H (ary yrs 24) Yale, Ys 24) = Edy(xi, ys 4). (8.7) 
Applying any symmetry transformation 7' we obtain 
H (Xi, Y;, Zi) Po(Xi, Yi, Zi) = Ep.(Xi, Yi, Zi), (3.8) 


where y in general has a different functional form from y,. Thus 
(Xi, Ya, Zi) is an eigenfunction of #(X;, Yi, Z;), but since 
H(Xi, Yi, Zi) and H (x1, yi, zi) have the same form, we can also say 
from (3.8) that (2, yi, 24) ts an eigenfunction of H’(xi, ys, 2%) and 
belongs to the same eigenvalue EF as %,. An alternative method of 
wording this argument is to say that since (3.8) is a differential equa- 
tion in terms of the variables X;, Yi, Z:, we can replace X4, Yi, Z; 
by 2%, yz, % or any other set of symbols throughout without upsetting 
the validity of the equation. Thus (3.8) becomes 


H (x4, Ys 24) he (x4, Yi 24) i Eb,(x, Yi Zi), (3.9) 


which is just our previous conclusion expressed in symbols. Thus 
we see that the symmetry transformations of a Hamiltonian can be 
used to relate the different eigenfunctions of one energy level to one 
another and hence to label them and to discuss the degree of degen- 
eracy of the energy level. Before we can pursue this further (§ 6), we 
must discuss in greater detail the symmetry transformations of 
Hamiltonians (§§ 3 and 4) and their effect on wave functions (§ 5). 

The Hamiltonian (3.2) has two other types of symmetry trans- 
formation besides the rotation R. The transformation 


(x, Yn Z) = (X3, Y;, Z,) 
(Xe, Yo Z2) a (Xi, Y,, Z;) 
(a4, Ys, 21) = (Xi, Yi, Zi), 


(3.10) 
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is called the interchange or permutation of the co-ordinates 1 and 2, 
and is a symmetry transformation of (3.2) as is obvious by inspection. 
Similarly any permutation of the co-ordinates x;, yi, 21, 1 = 1 to n, 
is a symmetry transformation. The other symmetry transformation 
is the inversion transformation IT 


Tl: w@=—-Xi, y=—Yi, u=—Z for alli. (3.11) 


This can be combined with the rotations. An ordinary rotation such 
as (2.2) is called a proper rotation, and the combination of a proper 
rotation with the inversion I] is called an improper rotation. As a 
particular example of an improper rotation, we have JIR(180°, x) 
which is just the reflection mz in the mirror plane x = 0, i.e. 


Maz: (4, Yt, 21) = (—Xi, Yi, Z) for all 7. (3.12) 


It can easily be verified that all improper rotations, as well as 
proper ones, leave the Hamiltonian (3.2) invariant. However, 
there are many simple and important transformations that are not 
symmetry transformations of (3.2), for instance the transformation 
to cylindrical polar co-ordinates 


x = R; cos O;, yi = Ri sin ©, “= Z. (3.13) 


This transformation is in any case not a linear one because it involves 
products of R; with trigonometric functions of @;. Also V;? becomes 
1 @ 7) 1 2 a 

R; aR; (2 om) + Re 068 + an” eine) 
which is not identical in form with (3.3), so that (3.13) is not a 
symmetry transformation. Of course we may wish to express the 
Hamiltonian (3.2) in terms of cylindrical polar co-ordinates for some 
problem, but in the future we shall refer to such a transformation 
as a change to polar co-ordinates, so as to avoid confusion with 
symmetry transformations which we will be considering so much 
that it will be convenient to refer to the latter simply as trans- 
formations. | 
We must now indicate briefly what the symmetry transforma- 
tions are for the Hamiltonians of physical systems besides free atoms 
and ions which we have been considering so far. An atom has com- 
plete spherical symmetry, i.e. it is invariant to any rotation about 
any axis (cf. problem 3.7), so that it has a higher degree of symmetry 
than molecules and crystal lattices which are usually only invariant 
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to certain rotations about certain axes (cf. problems 3.4 and 3.5). 
Thus the latter have some of the symmetry transformations of the 
atom, but not any radically new ones except for the translational 
symmetry of a crystal lattice. We have therefore already mentioned 
in connection with (3.2) almost all the types of symmetry trans- 
formation which we shall discuss. 

To sum up, the form of a Hamiltonian remains unchanged by 
certain linear transformations which are called symmetry trans- 
formations of the Hamiltonian. Symmetry transformations in 
general change the eigenfunctions of one energy level into one 
another. 


PROBLEMS 


3.1 Show that the following co-ordinate changes are not sym- 
metry transformations of the Hamiltonian (3.2). 


(a) (xi, Ys, 21) = (2X1, 2%4,2%;), t=l ton. 
(b) (ty, YA: %) = (—X,, —Y, —4,), 
(x4, Yt, 2) = (Xa, Va, Z), t= 2 ton. 
(c) a = exp X;, y; = exp Yi, 2 =exp Z, *#=Il1ton. 
(d) 2, ¥;, 2; given in terms of X,, Y,, Z, by equation (2.2), 
(x4, Yt, Zt) = (Xa, Vs, Ze), 4=2 ton. 
(e) 2; = Ri sin O; cos D;, y; = RK, sin O; sin D;, 
2 = FR; cos O;, 4=I1ton. 


3.2 Express the Hamiltonian (3.2) in terms of spherical polar 
co-ordinates r, 0, 6, where 


x =rsin 6 cos ¢, y =rsin 6 sin d, z=r cos 8, 


(Schiff 1955, p. 69). 
Show that the inversion transformation JJ takes the form 


ro Ri, =—7— &, di=r+h, 4=l1ton, 


and express also the rotation (2.2) and the other symmetry trans- 
formations mentioned in § 3 in polar co-ordinates. Hence, verify 
that they again leave the Hamiltonian (3.2) invariant. 

3.3 Write down the Hamiltonian without spin dependent terms 
for an ion of nuclear charge Z with n (not equal to Z) electrons, and 
show that it has the same symmetry transformations as the Hamil- 
tonian (3.2). Do the same for the one-electron Hartree equation 
(Schiff 1955, p. 284) for the single valence electron of a sodium atom. 

3.4 Write down the Hamiltonian without spin dependent 
terms for the two electrons in a hydrogen molecule, considering the 
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two protons as fixed at the points + (0, 0, a). Show that it is 
(a) invariant under any rotation about the z-axis, but only to 180° 
rotations about the x- or y-axes, (b) invariant under reflections in 
the plane through the origin perpendicular to the z-axis and in any 
plane containing the z-axis, (c) invariant under the inversion [J 
and the reflection in the z-axis 


(x4, YM, 24) —— (—Xi, — Yi, Zi) ‘= 1, 2, 


and (d) invariant under the interchange of co-ordinates 1 and 2. 

3.5 In problem 3.4, assume that one of the protons has been 
replaced by a deuteron, and suppose that the deuteron has a slightly 
different charge from that of the proton. What effect does this have 
on the symmetry properties of the Hamiltonian? Although in 
reality the deuteron and proton have the same charge, they do have 
different masses and magnetic moments, and this would affect the 
symmetry of the problem in a similar way to the fictitious difference 
in charge if the interaction with the nuclear moments were included 
in the Hamiltonian. 

3.6 Repeat the discussion of problem 3.4 in terms of spherical 
polar co-ordinates and in terms of cylindrical polar co-ordinates 
(equation (3.13)). Which set of co-ordinates do you think is most 
convenient for this problem? 

3.7 A rotation about the origin can be defined mathematically 
as a linear transformation of co-ordinates that leaves invariant the 
distance of an arbitrary point (2, y, z) from the origin. Using this 
definition, show that the Hamiltonian (3.2) is invariant under any 
rotation about any axis. Show that the definition includes the 
improper as well as the proper rotations (Margenau and Murphy 
1943, p. 310). 

3.8 Show that an improper rotation of 180° about any axis is 
the same as a reflection in the plane through the origin perpendicular 
to that axis. 

3.9 Write down the Hamiltonian for a hydrogen atom in small 
uniform electric and magnetic fields parallel to the z-axis (Schiff 
1955, pp. 158, 292) omitting spin dependent terms and considering 
the nucleus as a fixed charge. Also assume that the 2p eigenfunctions 

x +4 x—t 
A=, bee, =o, 


where f(r) is given by Schiff (1955, p. 85), are still eigenfunctions 
to a first approximation in the presence of the fields. Prove (a) the 


2p level is three-fold degenerate in the absence of the external fields, 
(b) in the presence of the electric field only, , and %, are degenerate 
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with one another but need ‘not be degenerate with y5, (c) in the 
presence of the magnetic field only, symmetry arguments do not 
require any of the functions y,, ¥, and , to have the same energy 
so that the 2p level may be split into three levels. Hint: in each of 
the cases (a), (b) and (c) test whether the reflection in the plane 
y = 0 and the rotation of 90° about the y-axis are symmetry trans- 
formations. If they are, use them to apply the argument of equa- 
tions (3.7), (3.8), (3.9) to each of the functions y,, %2, #3. Try also 
the inversion JJ, rotations about other axes and other reflections to 
ensure as far as possible that no degeneracy required by symmetry 
has been missed. 


4. Groups of Symmetry Transformations 


In this section we shall illustrate and define what is meant by 
a group in the mathematical sense of the word, and shall show what 
relevance this concept has to the symmetry transformations of 
Hamiltonians. 


Example of a group 

Let us first consider the symmetry properties of a particular 
physical object, namely an equilateral triangle cut out of a piece 
of cardboard having the same finish on both sides and lying on the 
table with its vertices at the points 1, 2 and 3 and its centre at the 
origin of co-ordinates (Fig. 3). Ok, Ol, Om are three other axes 


Fia. 3. Axes for an equilateral triangle. 


perpendicular to the sides, Ok being identical with the negative 
y-axis. A rotation A of 120° about the z-axis moves the vertex 
that was at the point 1 to the point 2, etc., and we shall call this an 
equivalent position of the triangle since it is indistinguishable from 
the original’ position. It can easily be seen that the following 
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rotations all leave the triangle in equivalent positions and that 
there are no other proper rotations that do this. 


: 120° about z-axis 

: 240° or —120° about z-axis 

: 180° about Ok axis 

: 180° about Ol axis 

: 180° about Om axis 

: 0° or 360° about any axis, i.e. no rotation. (4.1) 


by Be by ty be 


If we apply two rotations successively, for instance first A and then 
K, this moves the top vertex from position | first to position 2 and 
then to 3, the vertex at the position 2 to 3 and then to 2, the vertex 
at 3 to 1 and then to 1. Thus the combined operation A followed by 
K is identical with the single rotation L. Similarly K followed by A is 
the same as M, and it can easily be verified that combining any pair 
of the rotations (4.1) in either order gives another rotation which 
is also one of the ones listed in (4.1). If the rotation F applied 
First followed by S applied Second is equivalent to the single 
Combined rotation C, we write 


SF =C, (4.2) 


where it is customary to write the S before the F in analogy with 
differential operators. For instance 


d 
x aad (*) (4.3) 


means first differentiating f(x) and then multiplying the result by 
x*, This is clearly not the same as 


d 
ae xf (x), (4.4) 


and similarly when combining rotations it is important to follow 
the convention of (4.2). We have already seen that 


KA=L, AK=M, (4.5) 


and similarly it is possible to write down a whole multiplication 
table (Table 1) where the rotation in the top row is applied first and 
the rotation in the left column second. There is an important feature 
of Table 1, namely that for every rotation P, there is also a rotation 
P-1, say, which undoes the effect of P, and that P also undoes the 
effect of P-}, ie. 

PP- = P-1P = E. (4.6) 


In fact in every case P and P-' are just two rotations by the same 


14 GROUP THEORY IN QUANTUM MECHANICS 


angle about the same axis but in opposite directions. When the 
angle is 180° this of course makes P and P~' identical. It can also 
be verified from the multiplication table that the triple products 


TABLE | 
Multiplication Table for the Group 32 


E A B K L M applied first 
E E A B K L M 
A A B £E M K L 
B B E A L M K 
K K L M E A B 
L L M K B E A 
M M K L A B E 
applied 
second 


P(QR) and (PQ)R are always the same, so that they can be written 
unambiguously as PQR. Alternatively this follows directly from 
the physical nature of rotations as can easily be shown. These 
properties suffice to establish that the rotations H, A, B, K, L, M 
(4.1) are the elements of a group. 

Definition of a group. A group © is a collection of elements 
A, BOG, Deis oe which have the properties (a) to (e) below. The 
elements in the simplest cases may be numbers. They may also 
be any other quantities such as matrices, physical operations like 
rotations, or mathematical operations such as making a linear 
transformation of co-ordinates. 

(a) It must be possible to combine any pair of elements F and 8 
in a definite way to form a combination C which we shall write 


where as before F is the first element, S the second element and C 
the combination, if the order of F and S is important. In our 
example with the elements (4.1), the law of combination was “‘first 
apply rotation F and then S’’. With other groups the law of com- 
bination may be matrix multiplication or like addition. If for two 
elements PQ = QP then P and Q are said to commute, and if this 
is so for every pair of elements then the law of combination is 
commutative and the group is Abelian. 

(b) The combination C = SF of any pair of elements F and S 
must also be an element of the group. Thus a multiplication table 
among the group elements can always be set up like Table I. 
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(c) One of the group elements, E say, must have the properties of a 
unit element, namely 
EP = PE=P (4.8) 


for every element P. For instance omitting all reference to H would 
make it impossible to set up a complete multiplication table for 
the other rotations of (4.1) (cf. Table 1). This is related to the next 
property. 

(d) Every element P of the group must have an inverse P-! which 
also belongs to & with the property . 


PP1=P-P=E. (4.9) 
(e) The triple product PQR must be uniquely defined, 1.e. 
P(QR) = (PQ) R = PQR. (4.10) 


This is true for all the kinds of elements and laws of combination 
that we shall wish to deal with, but there are examples where it 
does not hold, e.g. 24 + (6 + 2) 4 (24 + 6) = 2! 

Two simple examples of groups are all positive rational fractions 
excluding zero with the law of combination being multiplication, and 
all positive and negative integers including zéro with the law of 
combination being addition. In the latter case it is interesting that 
zero plays the role of the unit element Z. The permutations of 
objects, i.e. the operations of rearranging them and not their different 
arrangements in a row, say, form the permutation group Pn also 
known as the symmetric group S,. The proper rotations by all 
possible angles about a fixed axis form the azial rotation group. 
This is clearly Abelian. The full rotation group (Chapter IT) consists 
of all proper rotations about all axes through a point, and this 
becomes the full rotation and reflection group when all improper 
rotations are included. There are thirty-two groups of particular 
interest formed from a finite number of particular rotations about a 
point and are known as point-groups (§ 16). These clearly do not 
include all possible finite groups of rotations because, for instance, 
the rotations by 360 r/n degrees about a fixed axis where r = 1 to n, 
always form a group of n elements. An example of a point-group 
is the group (4.1) which is called 32 (pronounced “three two”, not 
“thirty-two”) in the international notation, to denote that it 
includes some two-fold axes (rotations by 180°) perpendicular to a 
three-fold axis (120°, 240°). All the proper and improper rotations 
that move a cube to an equivalent position form the full cubic 
group m3m. In the older Schoenfliess notation these two point- 
groups are called D, and O,. All square matrices of a given order 
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and with non-zero determinant form a group, the law of combination 
being matrix multiplication. So do all unitary matrices (appendix 
A) of given order, and likewise all unitary matrices of given order 
and with determinant +1, as can easily be verified. Finally linear 
transformations of co-ordinates can form groups as we shall now see. 

The group of symmetry transformations of a Hamiltonian. 
Consider three protons fixed at the points 


r, = [0, 21/3a, 0], or, =[—3a, —+/3a, 0], 
r, = [3a, —1/3a, 0] (4.11) 


forming an equilateral triangle about the origin (Fig. 3). The 
Hamiltonian for one electron moving in the field of the three protons is 
2 2 e2 


fee de pine cern eye ony a 
=~ om * r—r)' |r 


—_—. 4.12 
|r —r,| —r,| (S12) 


This system is not one of physical importance but its symmetry is 
closely related to that of an ozone moleculet or that of an ion situated 
between three water molecules in the hydrated crystal of a salt, to 
which the following discussion can easily be extended (cf. problems 
4.5 and 4.6). The physical system of three protons has the same 
rotational symmetry as the equilateral triangle already discussed, 
which suggests that the linear transformations EZ’, A’, B’, K', L’, 
M’ corresponding to the rotations E, A, B, K, L, M (4.1) may be 
symmetry transformations of the Hamiltonian (4.12). These trans- 
formations can easily be found from (2.2) and simple extensions of 
the argument of §2. For instance A’ is obtained from (2.2) by putting 
a = —120° in accordance with § 2 because A (4.1) is a physical 
rotation of +120°. We obtain: 


: (t, y, 2) = (X, Y, Z) 
>a = —4X + 4,/3Y B’. w= —4X —4,\/3Y 
y = —3V3X — 3Y y = 3V3X — $Y 


5g —Z 
: (4.13) 


: (a, ¥y; z) = (—X, Y, —Z) 

>u=$X44/3Y M': « = 4X — 34/3Y 
¥ = $V/3X — 3Y y = —3V/3X — $Y 
2=-Z z=-—-Z 


} Throughout this book we shall assume for illustrative purpose that the 
three oxygen atoms in ozone form an equilateral triangle, though this is 
not so in actual fact (see end of § 24), 
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It can easily be verified that all these transformations are indeed 
symmetry transformations of (4.12). The V? remains invariant 
under each as in § 3, and applying for instance A’ to the other terms 
we obtain 


lr — |? = 2? + [y — 2/30}? + 22 
= (-4X + 4V3Y)* + [v3 — FY — 23a}? + 22 
— (X + 3a)? 4[Y + 3a}? + Z? 
= |R — r,|?, 

jr—r|?=|R—r,|? and |r —r,/? =[R —r,|?, 


whence (4.12) is invariant under A’. 

The result that the transformation A’ of (4.13) is a symmetry 
transformation of (4.12) is actually no accident and can be proved 
as follows using the corresponding rotation A without ever writing 
down the form of A’ or substituting in (4.12). Let the potential in 
(4.12) due to the protons be 


e2 
V(r) = > [rF—r|’ 


1, 2,3 


and let P(x, y, z) be any point. Consider the physical operation A 
of rotating the point P and the three protons but not the co- 
ordinate axes. The system of protons is moved into an equivalent 
position, one proton from r, to r,, one from rz to r, and one from 
r, tor;, and P is moved to the position (X, Y, Z). During this 
rotation the potential at P has remained constant because it depends 
only on the distances of P from the three protons, and these dis- 
tances have not changed because P and the protons have been 
rotated as a rigid whole. Thus V (x, y, z) due to the initial charge 
distribution is equal to the potential at (X, Y, Z) due to the final 
charge distribution. Since, however, A has moved the system of 
protons into an equivalent position, the initial and final charge 
distributions and potentials are identical, so that the potential at 
(X, Y, Z) due to the final charge distribution is V(X, Y, Z), i.e. we 
have 


Viz, y, 2) = V(X, ¥Y, 2), (4.14) 


where according to Fig. 1 of § 2,2, y,z and X, Y, Z are related by 
equations (2.2) with « = —120°. It only remains to view (4.14) 
and (2.2) in terms of a change A’ of co-ordinates, rather than in 
terms of physical rotations. This only involves a change in the 
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interpretation of (4.14) and (2.2) and does not destroy their validity 
as correct mathematical relations. We therefore obtain that V(r) 
is invariant under the co-ordinate transformation A’. This argument 
applies similarly to all the transformations (4.13), and indeed to 
any similar situation (cf. problem 4.8). 

We can also verify from (4.13) or prove by the above type of 
argument that the transformation A’ followed by K’ is the same 
as the single transformation L’. For in detail this simply means 
that first expressing a function f(z, y, z) in terms of X, Y, Z using 
A’ (4.13) and then in terms of &, 7, ¢ using K’ 


(X, Y, Z) = (—&, > —?), 


gives the same result as expressing it directly in terms of &, », ¢ 
using the combined transformation 


a= hi + V/3/2 
y = 43/3 — yy 
z=—f, 


namely the transformation L’, In symbols K'A’ = L’. Similarly, 
any of the transformations (4.13) can be combined, and their 
multiplication table is exactly the same as Table 1 for the point- 
group 32 of rotations as can be verified most easily by matrix 
multiplication. It is also easy to show that the transformations 
have all the other properties (a) to (e) above required for them to 
form a group, which we shall call the point-group 32 of transfor- 
mations. 


THEOREM. We shall now generalize this result and prove that 
the symmetry transformations of a Hamiltonian always form a group. 
Suppose a Hamiltonian # is invariant under each of two sym- 
metry transformations F and S. We shall first show that the 
combined transformation SF (first F, then S second) is also a 
symmetry transformation. Let the co-ordinates 21, 1, 21, Ta. Yo. +> 


zn of the Hamiltonian be written for convenience q;, qo, - - - Yn, 
and let F be the transformation 
n= Fy (4.15) 


when written in terms of the summation convention (appendix A), 
and S be the transformation 


ge = SyQ). (4.16) 
Now the transformation SF means to substitute first for the q; 
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in terms of the Q; using (4.15) and then to substitute for the Q; 
further in terms of some new variables »; where 


Qi = Sur. (4.17) 
Since F and S are both symmetry transformations, 
(MN) = #(Qi) = #(), (4.18) 


so that the composite transformation SF from the q; direct to the 
is also a symmetry transformation. Thus the symmetry trans- 
formations satisfy the group requirements (a) and (b) above. We 
can indeed write down the transformation SF explicitly by eliminat- 
ing the Q; from (4.15) and (4.17), ie. SF is 


Gs = (FieSug) ry. (4.19) 
Further we always have the identity transformation 
a%=% =%$‘#=1 to 3n, (4.20) 


having the property (4.8) of the unit element #, which verifies (c). 
As regards (d), if we substitute for the q; in the initial Hamiltonian 
in terms of the Q; using F (4.15) and obtain #(Q;), then we can get 
back to 4#°(q:) by solving (4.15) for the Q; and substituting into 
#(Q). But this is just applying the transformation F-} 


O = (F ua (4.21) 


which undoes the effect of F, and this is therefore also a symmetry 
transformation. In (4.21) the Q; are now the initial variables and the 
q; the new ones, and F-! is the inverse of the matrix Fy;. To make 
the argument quite rigorous, we note that all the transformations 
in which we are interested are unitary (cf. appendix A, problem 
A.9), whence | #| 40 and (4.15) can actually be inverted to give 
(4.21). This verifies (d), and (e) can easily be verified by writing 
out the transformation TSF 


a = FuSeiTyQ 


in full without using the summation convention and noting that it 
does not matter where the brackets of (4.10) are inserted. This 
proves the theorem. It is now possible to give a precise meaning 
to the expression “the symmetry properties of a Hamiltonian” 
which has been used in a descriptive way up till now. The symmetry 
properties of a Hamiltonian consist of the group of all symmetry 
transformations of the Hamiltonian. 

We shall now investigate the group of symmetry transformations 
of the Hamiltonian (4.12) in greater detail. Out of the six elements 
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(4.13), the elements H’, A’ and B’ form a group in themselves as 
can easily be seen from the first three rows and columns of Table 1. 
These elements chosen from the bigger group (4.13) are said to 
form a subgroup of the larger group. Another subgroup of (4.13) 
is the group (Z’, K’), another one (H’, L’) etc. Similarly, (4.13) 
does not include all the possible symmetry transformations of the 
Hamiltonian, but is a subgroup of the group of all its symmetry 
transformations. For instance, a symmetry transformation not 
included in (4.13) is the reflection 


(x, Yy; 2) as (=X; Y, Z) 


in the plane x = 0, i.e. in the plane kOz (Fig. 3). Further symmetry 
transformations are the reflections in the planes 10z, mOz and 
z = 0, and combinations of these with the rotations (4.13). These 
all together form the group of 12 elements called the point-group 
6m2 of transformations, and they would appear to be all the sym- 
metry transformations that the Hamiltonian (4.12) has. Incident- 
ally, there is no certain method of ensuring that one has found all 
the symmetry transformations of a given Hamiltonian instead of 
just a subgroup: one can only try all the transformations one can 
think of. Some of the symmetry transformations may not be at 
all obvious, one case being the symmetry property of the pure 
Coulomb 1/r field in a hydrogen atom which gives rise to the degen- 
eracy of all levels with the same principal quantum number 
irrespective of their angular momentum quantum number /, unlike 
the more general situation in an alkali atom (Schiff 1955, p. 86; 
Fock 1935). Thus some of the more subtle symmetry transforma- 
tions of certain systems have only been discovered relatively 
recently (Jauch and Rohrlich 1955, p. 143; Baker 1956; problem 
24.11). If we now consider n electrons in the field of the three protons 
or of three identical charges of any magnitude similarly arranged, the 
Hamiltonian for this system would have the transformations 6m2 
applied to each set (21, yi, 21), 7 = 1 to n, as symmetry transforma- 
tions. It also has the n! permutation transformations of the 1 
variables 2; etc., and all combinations between the permutations 
and the point-group 6m2 transformations. Thus the group of all 
symmetry transformations would have a large number (12n!) of 
elements, but it would be a simple combination of the groups 6m2 
and Pn. 


Isomorphism 


It was shown above that the elements of the group (4.1), say the 
group &, and those of the group (4.13), G’ say, both multiply in 
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the same way according to Table 1. This relationship between © 
and ©’ is called an tsomorphism and can be described by saying that 
the elements #, A, B,... of G can be paired off with the elements 
E'", A’, B’,... of &’ such that the relationships between Z,.A, B,... 
as regards multiplication are in every way the same as the relation- 
ships between EH’, A’, B’,.... Actually it requires more care to 
define isomorphism precisely and to distinguish it from the related 
concept of homomorphism (cf. appendix B), but the above des- 
cription is sufficient for the present considerations. That © and 6’ 
have the same multiplication table is not accidental, but follows 
quite generally from the relationship between a physical rotation 
and the corresponding co-ordinate transformation. For if we 
apply to a system a rotation F which shifts P(x =q,, y = q, 
z= 4x) to P’(Q,, Qs, Qs) related by (4.15), and then a rotation S 
moving P'(Q,, Q2, Qs) to P"(v,, v2, v3) related by (4.17), then the 
combined rotation SF shifts P(q,, 2, 93) to P"(v4, ve, v3) related by 
(4.19). But we had previously seen that equations (4.15), (4.17) 
and (4.19) represents the combination of linear transformations 
of co-ordinates, which therefore combine in exactly the same way 
as the physical rotations, this argument being quite general. In 
connection with our study of the Schrédinger equation, it is always 
the group of symmetry transformations of the Hamiltonian that we shall 
be interested tn (§§ 2, 3). However, the group of physical operations 
which move a system into equivalent positions can be used as an 
aid to the imagination and to suggest what the symmetry trans- 
formations of the Hamiltonian are. Thus in future we shall not 
distinguish the transformations (4.13) by primes from the rotations 
(4.1) and shall simply refer to either group as the point-group 32. 
‘Another example of isomorphism is afforded by the permutation 
group P, of order 3, which consists of all the transformations (ijk) 


(24, Wy 21) ae (X;, Yi, Zi), 
(Xe, Yo, 2) = (Xj, Yj, 2), | 
(3, Y3, 23) = (Xk, Yu, Ze), (4.22) 


where ijk is some permutation of the numbers 1, 2 and 3. These 
transformations can again be combined according to equations 
(4.15), (4.16), (4.17) and (4.19), where Fy, etc., are now matrices of 
order 9 x 9. The transformation (132) (231) means as usual the 
permutation transformation (231) followed by the permutation 
(132), and it is equal to the permutation (321) as can easily be | 
verified. Similarly the whole multiplication table can be set up 
(Table 2). On examination, this table is seen to have exactly the 


bo 
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TABLE 2 
Multiplication Table for the Group Ps; 


nt 


(123) (231) (312) (132) (321) (213) applied first 
i ee 
(123) (123) (231) (312) (132) (321) (213) 
(231) (231) (312) (123) (213) (132) (321) 
(312) (312) (123) (231) (321) (213) (132) 
(132) (132) (321) = (213) (123) (231) (312) 
(321) (321) (213) = (132) (312) (123) (231) 
(213) (213) (182) (321) (231) (312) (123) 
applied 
second 
i 


same structure as Table 1. Indeed if we write in Table 2 


E instead of (123), A instead of (231), 
B instead of (312), K instead of (132), 
L instead of (321), M instead of (213), 


then Table 2 becomes identical with Table 1, and the group fp, is 
thus isomorphic with the point-group 32. This situation often arises 
when one is considering groups with a small number of elements, 
but it must not be assumed that every group of six elements is 
isomorphic with the point-group 32. For instance, the group of 
numbers 


1, exp(iz/3), exp(#27/3), —1, exp(i47/3), exp(i57/3) (4.23) 


is not, the law of combination being multiplication (cf. problem 
4.2). However, it can be shown that any group of six distinct 
elements is isomorphic either with the point-group 32 or with the 
group (4.23). 


References 

All the matrix algebra required for this book is given in appendix 
A. Accounts of groups and their properties, more detailed than 
here but still introductory, may be found in Lederman (1953), 
Birkhoff and MacLane (1941) and other texts (see general references 
at the end of the book). 


Summary 

We have defined what is meant mathematically by a group. 
We have proved that all the symmetry transformations of a 
Schrédinger Hamiltonian form a group. In the examples studied 
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so far, this group of symmetry transformations is directly related to 
the physical symmetry of the system to which the Hamiltonian 
applies. 


PROBLEMS 


4.1 Verify the multiplication table (Table 1) for the group of 
transformations (4.13). 

4.2 Construct the multiplication table for the group of elements 
(4.23) and show that it cannot be made to have the same form as 
Table 1 by pairing off each of the elements (4.23) with one of the 
elements (4.1) in some way, i.e. that the groups (4.23) and (4.1) 
are not isomorphic. Note that this result also follows directly from 
the fact that the elements (4.23) all commute whereas the elements 
(4.1) do not. 

4.3 Prove that all unitary matrices of order » x ” form a group. 
Prove that all those with determinant +1 form a subgroup of this 
group (cf. problem A.8, appendix A). 

4.4 What are two other simple arithmetic operations besides 
division for which the associative law (4.10) does not hold ? 

4.5 Write down the Hamiltonian for all the electrons in an 
ozone molecule omitting spin dependent terms and considering the 
nuclei as fixed in the form of an equilateral triangle. Show that the 
Hamiltonian is invariant under the point-group 6m2 of transforma- 
tions applied simultaneously to the co-ordinates of each electron. 

4.6 In the hydrated crystal of an inorganic salt, three molecules 
of water of crystallization are arranged in an equilateral triangle 
at a distance a from the centre with their permanent electric dipole 
moments pointing in towards the centre. Calculate the potential 
V(r) near the centre in the form of a power series in x, y, z up to 
the cubic terms with the centre as origin, expressing it in the form 


A, + Ag(a? + y? + 2%) + Age? + Agi[(x + ty)? — (x — ty)¥]. 


Hence verify that V(r) is invariant under the point-group 6m2 to 
this order of approximation, and show that this is also true for 
the exact expression for V(r). Write down the Hamiltonian for an 
atom at the centre of the triangle and show that it is also invariant 
under 6m2. 

4.7 In the benzene molecule, the carbon nuclei lie in a plane in 
the form of a regular hexagon. Discuss the symmetry of the Hamil- 
tonian for the electrons in a benzene molecule. 

4.8 Write out in clear English a proof of the following theorem. 
If a given distribution of charges remains unchanged by a group of 
physical rotations, then the potential V(r) due to the charges remains 


3 
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invariant under the transformations corresponding to the rotations 
in the sense of § 2, and these transformations also form a group. 

4.9 Which of the point-groups 6, 6, 3, 3m (§ 16) are isomorphic 
with the group (4.1) and which with the group (4.23) ? 

4.10 The elements A, B, C,... of a group © are all distinct. 
If one element P is combined with each element of 6, show that the 
set of combinations PA, PB, PC, ...is identical with the group 6. 
Hence show that each row and each column of the group multi- 
plication table contains all the elements of the group once. Hint: 
every combination is equal to some element of © by the group 
property (b); it is only necessary to show that all the combinations 
are distinct, for then each element of © must occur exactly once 
among them. (Ledermann 1953, pp. 10, 32). 

4.11 Are the point-groups 32 and 6m2 subgroups of the full 
rotation group? Is Py a subgroup of PD» if n < m? 

4.12 Prove that all groups consisting of two distinct elements 
are isomorphic. Do the same for all groups of three distinct elements. 
Hint: show that the multiplication table can only have one form by 
using result of problem 4.10. 


5. Group Representations 


Transformation of functions 

In this section we shall study the effect of symmetry transforma- 
tions on wave functions or other functions. They do not behave 
in a simple way like the Hamiltonian does, and it will be necessary 
to introduce several new concepts. 

In § 2 we defined applying a transformation R like (2.2) to a 
function y,(z, y, z) to mean substituting the expressions (2.2) for 
x, y, z into ¥,(z, y, 2), and obtaining in general a new function 
wo(X, Y, Z). Further in § 3 we saw that if y,(z, y, z) is an eigen- 
function of a Hamiltonian #(x, y, z) (ef. equation (3.7)) and Risa 
symmetry transformation of this Hamiltonian, then ¥,(X, Y, Z) 
is an eigenfunction of #(X, Y, Z) (cf. equation (3.8)) with the same 
eigenvalue. Now to carry on using capital letters would be incon- 
venient and we might just as well say that ¢,(z, y, z) is an eigen- 
function of #(z, y, z) (cf. equation (3.9)). This reversion to lower 
case letters is particularly useful in discussing the relationship 
between the functions ¥, and 4. Accordingly we shall in the future 
short circuit the use of capital letters and say that applying the 
transformation R to y,(z, y, z) changes it into ¥,(z, y, z). This is 


written 
Ry, (z, y, 2) = po(x, y, 2). (5.1) 


SYMMETRY TRANSFORMATIONS 25 


y. is called the transformed function, and we say that R operating 
on ys, gives Ys, or R transforms ys, into ,. In more general terms we 
have the new definition: applying a transformation T given by 


un = TyQ; (5.2a) 


to a function f (qi) means first substituting (5.2a) for the q and thus 
obtaining a new function F(Qi) = f(TyQ;) of the Qi, and secondly 
replacing each Qi by qi which gives F(qi) =f (Tyq;). In symbols 


TH (a) = f(T). (5.3) 


In this notation the example of equation (2.3) becomes 
R(a, z) (x2 — y)*? = [x(cos « — sin «) — y(cos a + sin «)]?, 
and (5.2a) itself can be written in the form 
Ta = Ty. (5.2b) 


The transformation (5.2) can also be applied to differential operators 
as in §§ 2 and 3, so that in the new notation 7 is defined to be a 
symmetry transformation of a Hamiltonian # ift 


TH (41) = 4H). (5.4) 


Representations of a group 


Consider the transformations (4.13) operating on the functions 
x exp(—r), y exp(—r) where r? = x? + y? + 22, 


For instance in terms of the notation (5.3) and dropping the prime 
on A in accordance with § 4, we have 


1 
Axer = — 3 ver + ve ye, 


(5.5) 


t Sometimes it is convenient to adopt a different notation from (5.4) for 
a transformed operator P. We write formally 
T(Pyp) = (TPT)(TY), (5.4t) 
and define TPT-1 to be the transformed operator, which operating on the 
transformed function 7, gives the transformed product T(Py). This notation 
will only be used on a few occasions in later sections. 
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Thus using the transformation A, we have generated the matrixt 


hs We 
2 2 | 
Djj(A) = . (5.6) 
—V/f3 1 
ne 


where we have transposed the matrix of the coefficients in accor- 
dance with a convention to be explained below. In this way we can 
generate all the matrices shown in the first row of Table 3. If these 
matrices are multiplied together, it is seen that they have the 
property of multiplying according to Table 1. For instance 


Dix(K) Dyy(A) = Dy(L), 


corresponds to the relation 
KA=L. 


Because the matrices multiply in this way, they are said to form a 
representation of the group 32. This particular representation we 
shall call I’ (Table 3). More generally, the square matrices Dy(A), 
Diy(B), . . . form a representation of a group ®(A, B, .. .) tf the 
matrix Diy(C) associated with any combination C =SF is always 
equal to Dix(S) Dxj(F). In terms of the nomenclature of appendix 
B, we can simply say that matrices of a representation form a group 
which is homomorphic to 6. If the matrices are of order n x n, 
the representation is of dimension n. 

We can also operate with the transformations (4.13) on the 
function z exp(—r) and generate the representation ~@ of 1 x 1 
matrices (Table 3). The fact that the matrices are not all different 
does not prevent them from forming a representation, for it is 
easy to verify that they have the essential property, namely, if 
SF =C then D(S) D(F) = D(C) in every case. For instance 
KA=L and 1 x (—1)=-—1. The identity representation ¥ 
(Table 3) is also one-dimensional and consists entirely of ones, and 
could for instance be generated by operating on the function 
(x? + y*) exp(—r) (cf. problem 5.2). Clearly every group has such 
an identity representation. If the matrices of a representation 
(such as I’) are all different, the representation is called faithful: 
if not all different (such as ox or 4), it is unfaithful because we can 
find spurious relations such as D(A) D(K) = D(L£) which do not 
correspond to any relation in Table 1. 


{7 All the matrix algebra and terminology which is assumed in the text is 
contained in appendix A. 
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Although we have introduced representations by transforming 
some functions, a representation is just a group of matrices with 
the right multiplication properties. Indeed the representations of 
Table 3 might have been found by inspection of the multiplication 
table of the group 32 (Table 1) without considering the group 
elements as linear transformations and without operating on any 
functions. Thus in the definition of a representation there is no 
mention of the functions that might have been used to obtain the 
representation. Nevertheless, in this book we are exclusively con- 
cerned with groups of linear transformations and with their effects 
on wave functions and other functions, so that in practice repre- 
sentations always will arise in the way they have been introduced, 
namely, through the transformation properties of some functions. 
To see this in a general way, consider a linearly independent set of 
functions ¢,, do, . . . dn and a group © of linear transformations of 
which a typical one is 7’. If the functions and transformations are 
such that 7'¢; can in every case be expressed as a linear combina- 
tion of the set ¢,, $5, .. . dn, Le. 


T 4; = Diy(T) $i, | (5.7) 


then we have immediately the set of matrices D;;(7'). Furthermore 
these matrices form a representation of 6, for if C = SF, then 


Coy = SFO; = SDyy(F)bx = Drj(F)She = Dix(8)Diy( F) de 


so that by (5.7)t 
| Diy(C) = Dix(S)Dej(F). | (5.8) 


The functions ¢$,, $a, ..- dn are said to form a basis for the representa- 
tion Diy(T). We have said “a” basis for there are usually other sets 
of functions that give rise to the same matrices (cf. problems (5.2), 
(5.4)). However, we have used the words “the” representation 
because the matrices are uniquely determined. An alternative 
wording is to say that the functions $, ¢o, . . . dn transform according 
to the representation Di;(T) of &. Thus in our first example (equations 


+ Although it might at first sight appear more natural to define the matrices 
of a representation by 7'¢i = Di;(T')d; instead of (5.7), this would lead to the 
relation Dij(C) = Dix(F)DeS) which is the opposite of our convention 
(4.2) for the multiplication of group elements. This explains the necessity 
for using the transposed matrix of coefficients in (5.5), (5.6) and in (5.7). 
It may be helpful to write (5.7) in the form 714; = ¢$:Di;(T), the set of functions 
$i behaving as a row matrix. 
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(5.5), (5.6)) 2 exp(—r) and y exp(—r) form a basis for the re- 
presentation I" (Table 3) of the group 32, or in other words they 
transform according to the representation I of the group 32. It is 
of course only special sets of functions that have the property 
(5.7) and hence can form a basis for a group representation. For 
instance the functions cos 2, sin x, cos y, sin y do not form a basis 
for a representation of the group 32, nor do z* exp(—r), y? exp(—r). 
The letters D (German, Darstellung) and I’ are most commonly 
used for representations. 
Problems 5.1 to 5.7 are based on the preceding concepts. 


Vector spaces 


Suppose now that x exp(—r) and y exp(—r) are eigenfunctions 
of a Hamiltonian belonging to the same eigenvalue. If there are 
no further linearly independent eigenfunctions, the level is said to 
be two-fold degenerate. Nevertheless, there are an infinite number 
of different eigenfunctions belonging to this level because any linear 
combination 


caer + coyeT (5.9) 


where c,, C; are complex numbers is also an eigenfunction. All 
such functions together are said to form the vector space [x exp(—r), 
y exp(—r)], and it is often more convenient to consider the whole 
vector space rather than a particular pair of functions as being 
associated with the energy level. More generally, the vector spacet 


R (d}1, Go, - » » dn) consists of all functions 

C1, + Copy eae ies Cnbn (5.10) 
which are linear combinations of the $1, ba, . . . dn with complex 
coefficients. The functions ¢,, dg, . . . dn are the base vectors in the 


vector space, and they are also said to span the space R to denote 
that any function belonging to R can be expressed as a linear 
combination of the ¢;. A function (5.10) belonging to & is a vector 
in or of the space R. Now any particular vector space can be 
referred to different sets of base vectors. For consider all the functions 
of the form (5.9). They could equally well be expressed in terms of 
different base vectors (# + ty) exp(—r) in the form 


c!,(a + iy)et + c's — iy)e (6.1L) 
where c’, = 3(¢, — tc), C's = $(c, + %c,). Thus the two vector 


+ German, Raum. 
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spaces (5.9), (5.11) are equal and we can write | 

(ue, yo") = [(w + ty)e, (@ — ay)e~7]. 
Similarly by choosing other linear combinations, the base vectors 
for this vector space can be chosen in an unlimited number of ways, 
and the same applies to the general vector space (5.10). 

There is an important property of a vector space which is inde- 
pendent of the way the base vectors are chosen, and this is its dimen- 
sion. Consider again the set of functions ¢,, $,,...¢n. They are 
said to be linearly independent if it is impossible to find complex 
numbers «1, %, .-. a such that 


ayy + a%b_ +. --endn = 9, (5.12) 
i.e. if it is impossible to express one of the functions as a linear com- 
bination of the others. Thus x exp(—r), y exp(—r) are linearly inde- 
pendent functions, but x exp(—r), y exp(—r), (x + zy) exp(—r) 
are not because we have the relation 

—ae-* — iyeT + (x + ty)et = 0. 
In fact it can easily be shown (problem 5.11) that it is always 
necessary to have precisely two linearly independent base vectors 
to span the whole space [x exp(—r), y exp(—r)]. More generally it 
can be proved (appendix C, Theorem 1) that the number 1 of 
linearly independent base vectors required to span the whole of a 
given vector space is the same no matter how the base vectors are 
chosen, so that n is a well-defined property of the vector space, 
and is called its dimension. Thus we have the definition, the dimen- 
sion of a vector space is the number of linearly independent base 
vectors required to span the space. In the future we shall assume 
that the base vectors of a space have been chosen linearly indepen- 
dently unless the contrary is indicated. We shall also assume for 
convenience that they are orthogonal to one another and normalized. 
This involves no loss of generality. For suppose we are given a set 
of non-orthogonal base vectors ¢,, $2, . . - én. We can construct 
an orthogonal set ¢’,, ¢’2, - - - ¢’n a8 follows. Put 4’; = ¢,, then 
$’, = $, + ag’, and determine a such that ¢’, is orthogonal to ¢';. 
Then put ¢’s = $3 + 014’, + 5o¢’, with 5, and 5, such that ¢’, is 
orthogonal to ¢’, and ¢’,, ete. The functions ¢’;, $’s . .. ¢'n can 
then also be normalized. 
Problems 5.8 to 5.13 are based on the above concepts. 


Transformations in a vector space 
We shall be interested almost exclusively in vector spaces R 
(¢1, $a). - $n) in which the base vectors ¢; have the property that 
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for every transformation T of a group ©, T7'¢; is expressible as a 
linear combination of the dt, Le. . 


Ty = Dy(T) $e. (5.7) 


More concisely, 7'¢; always belongs to R. Then this property holds 
not only for the base vectors ¢; but for any vector ¢ = cy¢y, for 


Th = Toy = c7Dy(T) G4 


and thus 7'¢ belongs to R. Thus any transformation of 6 turns 
every vector of R into another vector also belonging to R, and we 
say that the space R is invariant under the group © of transformations. 
If R is a one dimensional space (¢,), the vector ¢, is said to be an 
invariant vector. This implies that 774, = c(T)¢, but the constant 
c(T) is not necessarily unity. 

There is another way of looking at transformations of functions 
in the vector space R which we shall mention briefly but have no 
occasion to use except in §§ 20 and 32. Consider any function 
d = cjg, and let T'¢; = ¢'; given by (5.7). Now the ¢'; can be 
used as new base vectors in the space, and the untransformed 
function ¢ can be expressed as a linear combination of them with 
coefficients c’;, i.e. 


b = Cghy = 0'5$'s. (5.13) 
The question is, how are the coefficients c’; related to the cj$? From 
(5.13) 
o's’ = CD(T) hi = cas, 
whence oo = Dy(T e's 
or cy = [ D( T)]g—tey, (5.14) 


where [D(T)]~! is the reciprocal of the matrix D(T). The difference 
between (5.7) and (5.14) is expressed by saying that the ¢; trans- 
form as base vectors and the c; as coefficients according to the 
representation Dj;(T). 


Equivalence 


We have seen that the functions x exp(—r), yexp(—r) form a 
basis for the representation I (Table 3) of the group 32. If now 
in the vector space [xz exp(—r), y exp(—r)] we choose different base 
vectors, say (x + iy) exp(—r), then we can use the matrices of I" to 
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determine the representation according to which the new base 
vectors transform. For instance from (5.5) 


A(x + iy)je? = Axe + 1Aye" 
- (- b- iS) + iy)err, 


and in this way the matrices of the new representation can be written 
down. More generally, consider a vector space R which is invariant 
under a group © of transformations, so that the base vectors ¢,, de, .. . 
gn of BR form a basis for the representation D,;(7') according to 
(5.7). The question now arises, if we choose new base vectors ¢’;, 
say, what representation do they transform according to? The new 
base vectors can be expressed in terms of the ¢i, 


$'5 = Pudi, 
since they belong to R. Also 
$j) = Py *d's. 


Then 
T¢'3 = TPydi = PyDeT) ox = Pix De T) Py d's 


Thus the ¢ transform according to the representation D'y(T) 
where in matrix form 


D'(T) = P“D(T)P, (5.15) 


and this representation is said to be equivalent to the representation 
Di(T). Definition: two representations Diy(T) and D'y(T) of a 
group are equivalent if a matrix P exists that relates the matrices of 
the two representations in the manner of (5.15). We shall refer to 
(5.15) as the equivalence relation or as an equivalence transformation. 

The equivalent representations Dj,j(7') and D'(7’) are in fact so 
closely related that we shall usually not wish to distinguish between 
them. It is then convenient to use the expression the representation 
D to mean any or all of the representations that are equivalent to 
Dy(T). In fact if two representations D(T’) and D’(T) are equivalent, 
we write 


D= I]. 


Thus we can say that the vector space ® transforms according to 
the representation D of R without having to specify a particular 
set of base vectors or the actual representation. On the other hand 
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when we want to discuss a particular set of base vectors ¢,, ¢o,... dn, 
we shall say that they transform according to the particular representa- 
tion Di(T). If the matrices of two particular representations D(T) 
and D’(7’) are equal, we shall denote this by writing 


Diy(T) = D'y(), 


and shall describe the two representations as identical. Thus although 
we have so far used I to denote the particular representation given 
in Table 3, we may in the future also use it to refer to any repre- 
sentation equivalent to this one. It will also be convenient to use 
“different representations’ to mean different non-equivalent 
representations, unless it is clearly specified that different particular 
equivalent representations are meant. 


Reducibility of a representation 


Consider the vector space R, [x exp(—r), y exp(—r), z exp(—r)] 
which is invariant under the group of transformations 32, so that the 
base vectors zexp(—r), y exp(—r), 2 exp(—vr) form the basis of a 
representation 4, say. Here 4 is a combination of I’ and » (Table 
3), the matrix corresponding to a transformation 7' being 


0 
Py(T) } 
0 
A(T) =| ------ Eredar (5.16) 
0 0! xT) 
] 
1 


Such a representation is said to be reducible, in particular 4 is 
reducible into I and »/, which is written 


4=r+. 


If in the vector space R, we had used the base vectors (x + y) 
exp(—r), (y +z) exp(—r), (2 +2) exp(—r), the matrices of the 
corresponding representation 4’, say, would not have the simple 
reduced form (5.16), but we still say that 4’ is reducible into 
and I and we write 4’ = I’ + &, because the form (5.16) can easily 
be achieved by choosing different base vectors, i.e. by applying 
a transformation of the type (5.15) to the representation. In general, 
a representation Djj(T') of a group & is reducible into the representa- 
tions D', D®, ... D'*) if a transformation of the type (5.15) exists 
which brings every matrix Dy(T) of the representation into the form 
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7] 
0 | pawn =O 


0 0 


with smaller matrices D“(T) along the diagonal and zeros elsewhere. 
This is written 


D = DY + DM 4+ DM 4+... + De), (5.18) 


and D is said to contain the representations D® or the D™ occur 
in D. Tn (5.18) it is quite possible for some of the components to 
be equivalent or equal to one another, i.e. to occur in D more than 
once. For instance if we apply the group 32 of transformations 
(4.13) to the two sets of co-ordinates 7,, y,, Z, and X,, Y2, 2, simul- 
taneously, then the functions 


% exp(—r,), 2, exp(—r,), ¥, EXP(—"1)), Yo Exp(—T2) 


transform according to a representation which is clearly equivalent 
tolr+TI. 

A representation that cannot be reduced is termed irreducible. 
For instance any one-dimensional] representation is automatically 
irreducible, and we shall prove at the end of this section that the 
representation I’ (Table 3) is irreducible. In fact the representa- 
tions of Table 3 (and those equivalent to them) are the only possible 
irreducible representations of the point group 32 (ef. § 14). Thus 
every representation of the group 32 can be reduced into a sum of 
the representations I’, < and ¥, an irreducible representation being 
regarded as already in reduced form. It can also be shown quite 
generally (appendix C, Theorem 3) that if a representation D is 
reduced into irreducible components according to (5.18), the reduction 
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is unique apart from equivalence; i.e. which D occur in D and 
how many times each one occurs is unique, but the order in which 
the matrices Dy‘(7’) are arranged in (5.17) and the particular 
elements Dy‘?(7') are not unique. 


Reducibility of a vector space 


So far we have discussed what the reducibility of a representa- 
tion means purely in terms of the matrices of the representation. 
Consider a vector space X (¢,, ¢y, - . . dn) of which the base vectors 
¢; transform according to the reducible representation Djj(7') of 
(5.18). A vector space transforming according to a reducible (or 
irreducible) representation is called a reducible (or irreducible) vector 
space, and we shall now discuss what the reducibility of the space 
R implies. Since the representation Dy(7') is reducible, there exists _ 
by definition some transformation of the type (5.15) which brings 
the matrices of D into the reduced form (5.17). Such a transforma- 
tion is equivalent to choosing new base vectors 4’; in R, and these 
¢'« therefore transform according to the matrices (5.17). Let R™ 
be the space spanned by the first , vectors ¢; where n, is the 
dimension of the representation D@). From the form of (5.17), 
R™ transforms according to the representation D“, and similarly 
successive sets of vectors span spaces R transforming according 
to the representations D™, Thus it follows directly from the 
reducibility of the representation Djy(7) that the vector space R 
can be split up or reduced into a series of invariant subspaces R‘ 
transforming according to the representations D). Expressed more 
descriptively, the transformations of © transform the functions of 
each subspace 8‘ into one another without mixing together the 
functions belonging to different subspaces. The importance of this 
arises very briefly as follows. In §3 we saw that symmetry trans- 
formations change the eigenfunctions belonging to the same eigen- 
value into one another. If now we consider all the eigenfunctions 
of a Hamiltonian as one big vector space R, then the functions 
belonging to different invariant subspaces R@ are in general 
associated with different energy levels, as we shall discuss in detail 
in the next section. 

Since the ¢’; span the whole space &, any function in the space R 
can be written as the sum of functions, one from each subspace R™, 
This is written in symbols 


R= RY +4 RM 4... + RY, (5.19) 


It can also be shown (appendix C, Theorem 2) that the reduction 
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of R has the further important property that the subspaces R‘# 
can always be made orthogonal to one another, i.e. any function 
from one RX is orthogonal to any function from any other R™, 
Physically this corresponds to the fact that eigenfunctions belonging 
to different energy levels of a Hamiltonian are always orthogonal. 
It is also possible to establish the criterion (appendix C, Theorem 
2) that if a space R contains a subspace, then R must be properly 
reducible. 

The various points above are illustrated by the following example 
of a reducible vector space. Consider the space &,(x?, y?, 2%, yz, 
zx, xy) where to preserve the symmetry the base vectors have been 
neither orthogonalized nor normalized. This space is invariant 
under the group 32 of transformations (4.13) since Ry consists of 
all functions of the second degree, so that the base vectors 2?, 


y”,... xy transform according to some representation Dy(T), say. 
For instance the matrix Djj(A) is 

4 3 : : : 4/3 

: : 3 ; 43 

: . 1 . , ‘ 

—-t 433 
; ‘ —4V/3 —} : 
—ivV3 4/3 —4 


This does not have the reduced form (5.17) and neither do the other 
matrices. However, the space R, must be reducible because by 
inspection it contains an invariant subspace, namely the vector 
x? + y? + z? transforming according to the identity representations 
JF (Table 3). In fact it can easily be verified that the space is reduced 
by choosing the new base vectors 


by = 2ary ¢, =e —y¥? 

b's = 2yz p', = — 22x 

b's = (2/15?) (a? + y? + 2?) 

$’g = (1/3?) [322 — (x? + y? + 27], (5.20) 


which are all orthogonalized and normalized to 167/15 as regards the 
angular variables. For instance any transformation of the group 32 
operating on a linear combination 


Cf’ + Co’, 
gives another linear combination of the same form without intro- 
ducing any components involving the other base vectors. Thus 
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¢';, % 2 Span an invariant subspace, and it may easily be verified 
that they transform according to the irreducible representation I" 
(Table 3). Similarly ¢’3, ¢’, transform according to I’, ¢’; according 
to £, and ¢’, according to ¥%. The whole set ¢’,,. . . 4’, therefore 
transforms according to a representation with matrices D’y(T) of 
the form 


P'y(T) 
ae y(T) 
Se eee eRe - im 

JF(T) 

Thus we have 
D=PT+TrH+JV+4+Y, (5.21) 

and R = ($'1, $'2) + (4's, $'a) + (G's) + ($'e)- 
Irreducibility of I" 


We have so far discussed some of the properties of reducible and 
irreducible representations and vector spaces, without indicating 
how in practice one might reduce a given representation or prove 
it to be irreducible as the case may be. We shall see in § 14 that 
there are very elegant general methods for doing this by setting up 
all the different (i.e. non-equivalent) irreducible representations of 
a group, and testing whether each of them is contained in the given 
representation. However, as an illustration of the concepts of this 
section, we shall give an ad hoc proof from first principles that the 
representation I" of Table 3 is irreducible. 

Let ¢,, ¢, be two vectors transforming according to the particular 
representation I4j(7') of Table 3. If I were reducible, it would have 
to be reducible into two one-dimensional representations, so that it 
would be possible to find two vectors in the space (¢,, ¢.) each of 
which is invariant under the group 32. It is therefore sufficient to 
prove that there is no vector 


p = Co, + Code (5.22) 


which is invariant under all transformations of the group. 
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Since ¢,, ¢, transform according to the representation Iy;(7'), we 
have from Table 3 


K¢, = —¢y Ko, = $2. (5.23) 
If now we have a relation of the form 

Cyp1 + Cob, = C'rp1 + C'2ba, 
then this implies c, == c’,, C2 = ¢’,; a8 otherwise we would have 


, 
Sees 
Cy cy = 


dy = 


which substituted into (5.23) clearly leads to a contradiction. 
Let us now suppose that there exists a vector ¢ (5.22) which is 
invariant under all transformations of the group 32. This means 


that 
K$ = y¢ 
where y is some constant, i.e. from (5.23) 


K(cydy + Cofe) = vlads + Corpo) = —\h, + Coo. 


Therefore —(y = yt, and Cy = V0; 
whence either y= -l, c¢, #90, c, = 0, 
or y = 1, c¢, = 0, Co # 0: 


i.e. the invariant vector has to be either 4, or ¢.. However, from the 
matrix Iyj(A) we see that neither ¢, nor ¢, is invariant under the 
transformation A, which gives a contradiction. Hence no invariant 
vector exists and the representation I’ is. irreducible. 


References 


In appendix C, the theorems about representations and vector 
spaces which have been quoted in this section are proved rigorously. 
However, in keeping the proofs as elementary as possible, much 
of the beauty of the theory has been lost, and the reader is referred 
to Van der Waerden (1932) for an elegant treatment, as well as the 
other books mentioned in the list of general references immediately 
preceding the bibliography at the end of the book. Birkhoff and 
MacLane (1941) give an elementary discussion of vector spaces 
which covers some aspects of the present section in more detail. 


Summary 


We have defined what is meant by a representation of a group 
and by a vector space. We have shown how a vector space which is 
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invariant under a group of transformations forms a basis for a 
representation of the group. Using different sets of base vectors in 
the space gives equivalent representations. Vector spaces and the 
corresponding representations may be reducible or irreducible. 
A reducible vector space can be split up into a sum of irreducible, 
mutually orthogonal subspaces. 

This practically completes the number of new mathematical 
concepts required in this book. 


PROBLEMS 


Note: Throughout these problems the group 32 is the group of 
transformations (4.13), D, is the group of permutation transforma- 
tions (4.22) of order 3, and & is considered to be a general group 
of linear transformations. 

5.1 Write the answers to problem 2.1 in terms of the new defi- 
nition of transforming a function and the notation of equation (5.3). 

5.2 Verify in detail that the functions xexp(—r), y exp(—r) 
transform according to the particular representation I’ of Table 3 
under the group 32, and that zexp(—r) and (x* + y?) exp(—r) 
transform respectively according to the representations .~ and ¥. 
Write down some other functions transforming according to .~/ and 
J. Verify that the matrices of each of these representations multiply 
according to the group multiplication table (Table 1). 

5.3 Show that the functions x + y, x — y transform according 
to a representation of the group 32. Write down explicitly some of 
the matrices of this representation, and verify that they multiply 
according to the group multiplication table. 

5.4 Show that 27y and x? — y? transform according to the 
representation I” (Table 3) of the group 32. Show that yz and 
—zx do likewise. ¢, and ¢, transform according to I: if ¢, = xyz, 
what is ¢,? 

5.5 Show that the functions x’, y?, z? do not form a basis for a 
representation of the group 32. 

5.6 Show that the matrices of each of the representations of 
Table 3 form a representation of the group P;, and identify the 
group element which each matrix represents. In this case the 
representations % and are called respectively the symmetric and 
antisymmetric representations. More generally if the groups ©, and 
&, are isomorphic with one another, show that a representation of 
the one group automatically forms a representation of the other. 

5.7 Show that the functions 2,2.y3 — %Yo%z and 742%, — YiXo%s 
transform according to a representation of the group P,; and write 
down explicitly some of the matrices of the representation. 


4 
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5.8 Write down five different pairs of functions that span the 
same space [xz exp(—r), y exp(—r)], the two functions of each pair 
being orthogonal and normalized to the same value. 

5.9 Show that the two functions of problem 5.7 are not ortho- 
gonal, and write down two functions spanning the same space that 
are orthogonal. 

5.10 Show that the vector space (x? — y*, y? — 2%, 22 — x?) 
has only the dimension two and not three. 

5.11 Consider the vector space [x exp(—r), y exp(—r)] and show 
from first principles 

(i) it is impossible to span the whole vector space with one base 
vector; 

(ii) any two vectors of the space always span the whole space 
provided they are not linearly dependent, i.e in this case they are 
not simple multiples of one another; 

(iii) three vectors in the space are always linearly dependent. 

5.12 Prove that the maximum number of linearly independent 
vectors that can be found in a vector space, is equal to the mini- 
mum number of base vectors required to span the whole space. 
(Both are equal to the dimension of the space). 

5.13* Show that all functions of the form (5.10) form a group, 
the law of combination being addition. What is the unit element ? 
Is the group Abelian? Show that viewing a vector space as a group 
in this way is a very powerful concept in setting up the mathematical 
theory of group representations. (Van der Waerden 1932, Chapter 
IL.) 


5.14 Using some of the functions of the preceding problems, 
write down some vector spaces that are invariant under the group 
32 and that have dimensions from one up to five. 

5.15 The pairs of functions x exp(—r), y exp(—r) and (x + ty) 
exp(—r) transform respectively according to the particular re- 
presentations Ij;(Z') and I’;;(7) under the group 32. What is the 
matrix P which in the manner of equation (5.15) relates the matrices 
Pj(T) and Iy(T)? Verify equation (5.15) for some of the matrices 
Ly(T) and I"y(7). 

5.16 Dy(T) and D'y(T) are the matrices of two equivalent 
representations of a group, defined by equation (5.15). Show that 
Du(T) = D'yj(T) (where the double suffix is summed on each 
side), and verify this relation using the two representations of 
problem 5.15. Verify also that if D(S) D(F) = D(C), then D’(S) 
D'(F) = D'(C), so that D’(T) is really a representation of the group. 

5.17 The vector space (y,%2%3, X,Y_%3, 1,X,y,) transforms under 
the group P, according to the representation D. Express in your 
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own words what is meant by the equation D=I' + .%. Hint: the 
functions 2,74, — %y_t, and —4/(1/3)(x,%y3 + 2Yy2%3 — 2YXo%z) 
transform according to I’. See also problem 5.6. 

5.18 Show that the vector space (a, xy, x®y?, zy3, y*) is reduc- 
ible under the group 32 by finding a one-dimensional subspace in it. 

5.19 & is the vector space of equation (5.19). Show that it is 
not true to say that any function of R is equal to some function of 
R or some function of RX! or. . . ete. 

5.20 Show that the permutation group P, has two one-dimen- 
sional representations, the symmetric one in which both elements 
are represented by -++1, and the antisymmetric in which the elements 
(12) and (21) are represented by -++-1 and —1 respectively. Write 
down some functions that transform according to these representa- 
tions, and prove that no other irreducible representations of ), exist. 

5.21 Show that the identity transformation EF and the inversion 
(3.11) form a group which is isomorphic with P, and hence that the 
symmetric and antisymmetric representations of problem 5.20 are 
the only irreducible ones. In the present case the irreducible 
representations are called the even and odd ones, and corresponding 
functions are said to have even or odd parity. 

5.22 By considering functions of the form (x + iy)” exp(—r), 
deduce six different one-dimensional representations of the point- 
group 6 (§16). This group consists of rotations by 27r/6 radians, 
r = 0, 1, 2, 3, 4, 5. 

5.23 Show that the vector space [zexp(—r), y exp(—r), 
z exp(—r)] is invariant under the full rotation group, and that it 
transforms according to an irreducible representation of the group. 
Hint: if the space were reducible, it would contain an invariant 
vector. Show that this is impossible by considering rotations of 90° 
and 180° about the x-, y-, z-axes. 


6. Application to Quantum Mechanics 


In the last section, we introduced the concepts of representations 
and vector spaces, and we shall now establish what their relevance 
is to quantum mechanics. We have already seen in §§3 and 5 
(e.g. equations (3.7) to (3.9)) that the eigenfunctions of a Hamil- 
tonian belonging to one energy level are transformed into one 
another by symmetry transformations of the Hamiltonian, and this 
suggests that different invariant vector spaces are associated with 
different energy levels. We shall now use the concepts and results 
of the last section to embody this type of argument in three precise 
theorems. 
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THrorEM 1. If a Hamiltonian is invariant under a group & of 
symmetry transformations, then the eigenfunctions belonging to one 
energy level form a basis for a representation of ®. 

Let qi,7 = 1, 2,... be the co-ordinates in which the Hamiltonian 
# is expressed, and let (q;) be an eigenfunction belonging to the 


energy level L. 
Hq) = EYh(n). (6.1) 
If T is any symmetry transformation of ©, then 
TH =H. (5.4) 
Also let 


Tyg) = $'(9). 
Then operating with 7 on both sides of equation (6.1) we obtain 


| HY (qi) = Ee (ui); | 


so that J'(q) is an eigenfunction belonging to the same energy 
level # as %6(q;). Similarly, any eigenfunction belonging to £ is trans- 
formed into another eigenfunction by the transformations of 6. 
Also if 4, and y, are eigenfunctions belonging to H, then so is 
C14, + Cyit.. Hence all eigenfunctions belonging to one energy 
level form a vector space which is invariant under © and which 
therefore forms a basis for a representation of ©. This proves the 
theorem. 

The importance of this theorem lies in the fact that we can label 
and describe an energy level and its eigenfunctions simply by naming 
the representation associated with it. This clearly does not tell us 

everything we may wish to know about the eigenfunctions such as 
detailed numerical tabulations, but it does indicate their symmetry 
properties which is frequently all that is of interest in establishing 
selection rules for transitions and other qualitative behaviour. For 
example, consider the Hamiltonian for a lithium atom with three 
electrons, including the spin dependent terms which we need not 
write down in detail. Since all electrons are alike, this Hamiltonian 
is invariant under the group P, of permutations (4.22) of the electron 
co-ordinates. The group P, was shown in § 4 to be isomorphic with 
the point-group 32, and therefore any representation of the group 
32 is automatically also a representation of P;. Thus P,; has the 
three different (i.e. non-equivalent) irreducible representations I’, 
JF and ¥# of Table 3 (cf. problem 5.6). From Theorem 1 the eigen- 
functions associated with one energy level form an invariant vector 
space under the group P,, and this vector space can be reduced 
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into subspaces each of which transforms according to one of the 
representations I’, %, ... A wave function transforming according 
to the representation ./ is antisymmetric in the usual quantum 
mechanical sense, ie. it changes sign if we interchange the co- 
ordinates of any two electrons (cf. problem 6.1). Now it is known 
experimentally that these antisymmetric states corresponding to 
the representations »/ are the only ones ever found in nature, so 
that this sorting out of the wave functions according to the irreducible 
representations of the group P, is very important. 

As another example consider the Hamiltonian (3.2) of a free 
atom. This is invariant under the group of transformations con- 
sisting of the identity transformation and the space-inversion II 
(3.11). This group has only two one-dimensional irreducible re- 
presentations, such that the corresponding function is multiplied by 
+1 or —1 by the inversion transformation, in which case the 
function is described as having even or odd parity respectively 
(ef. problem 5.21). As in the preceding example, the eigenfunctions 
associated with each energy level of an atom can be reduced until 
each eigenfunction has either even or odd parity. This, for instance, 
leads to the selection rule for optieal transitions that the initial 
and final states must have opposite parity. Similarly, we shall see 
later that angular momentum quantum numbers L, S, J, mz, ms, 
m, are simply labels for the irreducible representations of the 
rotation group. 

Corollary to Theorem 1. If @ Hamiltonian is invariant under a 
group & of transformations, then eigenfunctions of the Hamiltonian 
transforming according to one irreducible representation of & belong 
to the same energy level. 

From Theorem 1, the vector space of eigenfunctions belonging to 
one level is either irreducible or can be reduced into subspaces 
each of which transforms according to an irreducible representa- 
tion of &. It thus never happens that eigenfunctions belonging to 
the same irreducible vector space belong to different energy levels, 
which proves the corollary. 

As a trivial example, consider the 2p wave functions of a hydrogen 
atom 


te = af(r), dy =f (r), de = f(r), (6.2) 


where f(r) is some function of r (Schiff 1955, p. 85). The Hamil- 
tonian of a hydrogen atom is invariant under all rotations, and it 
can be shown that the functions (6.2) transform according to an 
irreducible representation of the rotation group (§ 8 or problem 
5.23). They therefore belong to the same energy level. 
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The corollary tells us that eigenfunctions belonging to the same 
irreducible vector space necessarily belong to the same energy 
level. The question naturally arises, do eigenfunctions belonging to 
different irreducible vector spaces always belong to different energy 
levels? In general the answer is that they do not have to belong to 
different energy levels. However, if we find several irreducible vector 
spaces to be associated with each energy level in a systematic 
way, there must be some symmetry property that produces this 
degeneracy. Hence, if we have included all possible symmetry 
transformations in the group ©, we would expect different irreduc- 
ible vector spaces of eigenfunctions to have different energy, simply 
on the grounds that there is no symmetry property remaining to 
make them have the same energy. This conclusion is borne out by 
experience, though occasionally it has not been easy to discover 
all the symmetry transformations of a Hamiltonian, as already 
mentioned in § 4. Nevertheless, a few accidental degeneracies may 
still remain. For instance in a magnetic field two energy levels 
corresponding to different irreducible representations can cross one 
another as the field is varied, so that for a particular value of the 
field they are degenerate. Accidental degeneracies also occur 
among the energy levels in a crystal (problem 26.8). We can 
systematize our conclusion by defining an accidental degeneracy to 
be a degeneracy which is not brought about in the manner of 
Theorem 1 by any symmetry property of the Hamiltonian. With 
this definition we then have: 


THEOREM 2. If the group © includes all possible symmetry trans- 
formations of the Hamiltonian, then the eigenfunctions of each energy 
level transform irreducibly under G, apart from accidental degeneracy. 

Proof: the eigenfunctions of one irreducible vector space are 
certainly degenerate by the corollary to Theorem 1. Furthermore 
they are transformed purely among themselves by 6, and are not 
linked with any outside eigenfunction by any symmetry trans- 
formation. Thus by definition any remaining degeneracy is 
accidental. 


The effect of perturbations 


It is frequently convenient in quantum mechanics when discus- 
sing a complicated Hamiltonian #, to split it up 


into a relatively simple part #, and a perturbation #,. Then the 
energy levels and eigenfunctions of #, can be studied in detail, and 
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hence the effect on them of adding #, to the Hamiltonian can be 
calculated. This is particularly useful if it is possible to choose 
H#, and #y so as to make the perturbation of the eigenfunctions 
and energy levels relatively small. Now #, being simpler than #7 
usually means that it has a higher degree of symmetry. For instance 
it is easier to calculate the energy levels of an electron in a spheri- 
cally symmetrical potential than in a potential varying arbitrarily 
in any direction. If #, has a higher degree of symmetry than #, 
it implies in general that the energy levels of #, are more degenerate 
because there are more symmetry transformations to make more 
eigenfunctions have the same energy in the manner of Theorem 1. 
Thus #, tends to produce a splitting of these levels. 


THEOREM 3. In the notation of (6.3), if #, #, and Hp are all 
invariant under a group © of symmetry transformations, and if 
the eigenfunctions of an energy level of #, transform according to 
the representation D = DY + D@® +... + D™ where the D™ are 
the irreducible components of D, then the greatest splitting that the 
perturbation Ay can cause is into n levels. The eigenfunctions of 
each of these split levels transform according to a sum of the D™ such 
that each D™ is associated with one of the split levels. 

Consider the eigenfunctions and energy levels of the Hamiltonian 


H =H + «Hy 


where ¢ is varied continuously from 0 to 1. It may be possible to 
achieve this variation physically such as by the reduction of a 
magnetic field to zero, or it may be a purely mathematical device. 
For arbitrary « 4 0 consider an energy level H,. The eigenfunctions 
of this level transform according to some irreducible representation 
D, or according to some reducible representation D™ + D+... 
if G does not contain all the symmetry elements of #. As « is 
varied continuously, the energy levels and eigenfunctions vary 
continuously, and the representations D‘ etc. cannot make a 
discontinuous change to some different (i.e. non-equivalent) re- 
presentation. Thus as ¢ is varied continuously to zero, the only 
thing that can happen is that several energies coalesce into one, 
with the representation D™ +- D@® + ...+ D™ of the composite 
level corresponding exactly to the components D™, D!®), D™ ... 
D) which have coalesced. Looking at it the other way round 
now, we can say that the degenerate level is split by % into a 
maximum of 7 levels associated with the n irreducible components 
D™ to D™, This proves the theorem. By Theorem 2, if the group 
© includes all the symmetry transformations of # (but not #4), 
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the splitting would. be the maximum allowed, except for possibly 
some accidental degeneracies of the types already mentioned, It 
should also be noted that the result of the theorem is precise and 
does not depend on %p being small. 

As an example, consider a free hydrogen atom with the electron 
in a 2p level, or any atom or ion with a single electron in a p level 
outside full quantum shells (Schiff 1955, p. 277). The Hamiltonian 
for the electron is 

2 
HAH, = Oa V2 — eVir), 

where V(r) is the spherically symmetrical potential due to the 
proton or the closed shell ion core respectively. The p eigenfunctions 
have the form (6.2). If now the whole atom or ion is placed in an 
electric potential V;,(r) with trigonal symmetry corresponding to 
the point-group 32, we have to include in the Hamiltonian # for 
the electron the term 


Hp = —€V3,lF). 


If the atom or ion is in a crystal, this potential could be due to the 
surrounding atoms (cf. problem 4.6). Thus %, #, and # =H, 
+ Hy are all invariant under the point group 32. We have already 
seen above that in the free state in the absence of #p, the three 
eigenfunctions (6.2) all have the same energy. However, they 
transform according to the reducible representation 4 = I+ ¥ 
(5.16) of the group 32, so that by Theorem 3 we may expect Hy to 
split the triply degenerate level of #, into a non-degenerate level 
(#) and a doubly degenerate level (I"). We shall see in § 10 that the 
wave functions of a many electron atom or ion in a P state (Schiff 
1955, p. 287) transform under rotations in exactly the same way 
as the functions 2, y, z. Thus they also transform according to the 
representation 4 of the group 32, and when the atom or ion is 
placed in a trigonal crystalline electric field, we may expect the P 
level to be split as before into one singly and one doubly degenerate 
level. The (28 + 1)-fold electron spin degeneracy and the effects of 
spin-orbit coupling are superposed on this whole picture. 


Summary 


If a Hamiltonian # is invariant under a group © of transforma- 
tions, then the eigenfunctions belonging to one energy level # 
form the basis of a representation D of ©. This representation can 
be used to characterize the level. If G includes all symmetry trans- 
formations of #, then D is irreducible, apart from accidental 
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degeneracy. If #, and #, are both invariant under © where Hp 
is a perturbation on #5, then #p can split the energy level Ey of 
H#, according to the number of irreducible components in the 
corresponding representation D. 

These two theorems therefore substantiate the claims (i) and (ii) 
of § 1 for the uses of symmetry properties, and it remains to apply 
the method to particular systems by studying their symmetry 
transformations and the corresponding representations. It is con- 
venient to defer until § 13 the third use of symmetry properties 
mentioned in § 1, namely the calculation of matrix elements and 
selection rules. 


PROBLEMS 


6.1 A three electron wave function % transforms according to 
the representation ./ (Table 3) of the group P3 (cf. problem 5.6). 
Show in detail that it is antisymmetric in the usual quantum 
mechanical sense, i.e. it changes sign if we interchange the co- 
ordinates of two electrons or make a permutation corresponding to 
an odd number of such interchanges, and remains invariant under a 
permutation corresponding to an even number of such inter- 
changes. 

6.2 Verify that the five d wave functions 


2(a + ty)ef(r), (w + ty)*f(r), v/(2/3)(82? ~ r*)f(r) (6.4) 


span an invariant vector space under proper and improper rotations 
about the z-, y- and z-axes. Note that this is not immediately 
obvious from the fact that they are all of the second degree in 2, 
y, z, because there is a sixth such linear combination x? + y? + 2’. 
This transforms according to a different representation under 
rotations, and therefore does not get mixed in with the d functions. 

6.3 From problem (6.2) and equations (5.20), (5.21) show that 
the five-fold degenerate level of an ion in a D state (neglecting 
spin degeneracy) splits into one singly and two doubly degenerate 
levels when the ion is placed in a crystalline electric field with 
trigonal point-group symmetry 32. This is the case of the Cutt 
ion in copper fluorsilicate CuSiF,.6H,O (Bleaney and Stevens 
1953). 


Chapter IT 
THE QUANTUM THEORY OF A 
FREE ATOM 


The purpose of this chapter is to illustrate the use of group 
theory in quantum mechanics by giving a brief sketch of the quan- 
tum theory of a free atom or ion. In §§7, 8, and 9 we shall study 
the particular groups of symmetry transformations that we shall 
need, and the mechanics of handling their representations. In 
§§10, 11 and 12 we shall use the theorems of §6 to classify the energy 
levels and their wave functions, and to discuss the splitting of the 
levels by various perturbations. In §13 we shall develop the third 
use of group theory mentioned in §1, namely the calculation of 
matrix elements and selection rules. In fact we shall derive rigor- 
ously the usual features of what is known as the vector model of 
an atom (Pauling and Goudsmit 1930), and we shall lay the founda- 
tions of a complete treatment so that any energy level, transition 
probability, etc., can in principle be calculated. For references for 
this whole chapter, the reader is referred to the list of general 
references preceding the bibliography at the end of the book. 


7. Some Simple Groups and Representations 


Cyclic groups 
The simplest type of group is the cyclic group Cn of order n 
whose elements are (H, A, A?, A3,... A®-1) where A™ = FE. Let 


Dij(At) be a representation of €,. Dy(A) can always be reduced 
to the diagonal matrix 


A = diag [A,,4,,...] = fA, . 
Ag 


: 


by an equivalence transformation (5.15) (Van der Waerden 1932, 
p. 26; Margenau and Murphy 1943, p. 316). Since Ar must always 
be represented by [D(A)},, the same transformation also transforms 
D(A‘) into Ar = diag [A\", A", . . . ]. Hence the representation D 
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has been reduced to a sum of one-dimensional representations, 
and we conclude all irreducible representations of €» are one-dimen- 
sional. Also since A” = H, we must have A,” = 1, whence 


I 
Am = exp as (7.1) 


Thus €, has » different irreducible representations. An example of 
the group €, is the group of n rotations about one axis by 360/n 
degrees and multiples thereof. The permutation group , of order 
two is a cyclic group of order two, and so is the point-group I con- 
sisting of the inversion (2, y:, 21) = (—X1, —Y+%, —Z,) and the 
identity transformation, and likewise the point-group m consisting 
of a reflection like (3.12) and the identity transformation. From 
(7.1) each of these has two one-dimensional irreducible representa- 
tions in which the identity element is represented by +1 and the 
other element by +1. These representations are called the sym- 
metric or even one and antisymmetric or odd one respectively. 


Abelian groups 


As mentioned in § 4, an Abelian group is one in which any two 
elements commute. A cyclic group is automatically Abelian, but 
all Abelian groups are not isomorphic with one of the cyclic groups. 
For instance the translation group of a three-dimensional crystal 
lattice is not. However, it is shown in appendix E that all of a 
group of commuting matrices can be simultaneously reduced to 
diagonal form, so that all the irreducible representations of an Abelian 
group are one-dimensional. 


The axial rotation group 


The rotations (2.2) by all angles ¢ about a fixed axis form the 
axial rotation group. This is Abelian, since a rotation by ¢, followed 
by a rotation by ¢, is the same as a rotation by ¢, followed by one 
by ¢,, as long as they are about the same axis. Thus the irreducible 
representations are all one-dimensional. Let (4) represent the 
rotation ¢ in one of these irreducible representations. Then we 


must have 
x(pa)x(F2) = x(b1 + $2) (7.2) 
with x(27) = x(0). The solution of this equation is 


x(¢) oa efms, m= 0, +1, +2, pia od (7.3) 
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and it will be convenient to refer to this as the m‘® or the exp(tmd) 
representation, the latter notation being preferable when the sign 
of the exponent is important. Thus there are an infinite number 
of one-dimensional representations, and every representation is 
reducible into a sum of these. 


The permutation group Pn 


This group is not Abelian for n > 2 and the theory of its re- 
presentations is not simple. We shall only prove here that for any n 
it always has two one-dimensional representations, namely, the 
symmetric and the antisymmetric ones. These two representations 
are the only ones in which we shall be interested, and all other 
irreducible representations have dimensions greater than one. 

Consider two rows of digits (Fig. 4a) and the effect on the diagram 


t+ 2 3 #4 § 

» | P| dl 
i 2@ 3 #4 5 
2 .3 4 $6 
eS | ll 
2 t+ 3 4 = °5 


2 5 | 3 4 | 


Fic. 4. Odd and even permutations. 


of interchanging pairs of neighbouring digits in the lower row. 
Interchanging the digits 1 and 2, we obtain Fig. 4b, and thus 
introduce one intersection into the diagram of lines joining the 
same digits in the two rows. Further interchanges introduce further 
intersections, but the effect of one interchange is always to add an 
intersection or to remove one. For instance, in the diagram Fig. 4c, 
the interchange of 3 and 4 would add one intersection and the 
interchange of 5 and 1 would remove one. Hence starting with the 
permutation (12345) in the lower row, an odd (or even) number of 
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consecutive interchanges results in a permutation giving a diagram 
with an odd (or even) number of intersections. A given permutation 
can accordingly be classed unambiguously as odd or even. Consider 
now the permutation transformation P(ij ... ) 
(X45 Ya, 21) = (Xe, Yi, Z) 
(2p, Yo: 2a) —= (X;, Y;, Z3) 
(X3, Ys; 23) == 2. 2 © © © 


(7.4) 


of the sets of co-ordinates (xz, yx, Ze), k = 1 to n. This trans- 
formation can be obtained as above by a sequence of elementary 
transformations in which only pairs of “neighbouring” co-ordinate 
sets are interchanged. As before, the transformation can be classed 
as odd or even according as the number of elementary interchanges 
is odd or even. Now consider the product P,P, of two permutation 
transformations. If P, and P, are equivalent to sequences of n, 
and n, elementary interchanges respectively, then the product 
transformation P,P, is equivalent to n, +, elementary inter- 
changes. Thus ifn, and , are both odd, P,P, is an even permutation 
transformation. In this way we obtain the following skeleton 
multiplication table for permutations: 


(odd) x (odd) = (even), (even) x (even) = (even), 
(even) x (odd) = (odd), (odd) x (even) = (odd). (7.5) 


This table shows immediately that there exists a one-dimensional 
representation of the group P» in which each even permutation is 
represented by +1 and each odd one by —1. This is known as 
the antisymmetric representation. As always, there is also the identity 
representation, here called the symmetric representation, in which 
each group element is represented by +1. When 7 is greater than 
2, there are in addition irreducible representations with higher 
dimensions (cf. problem 5.6). 


Summary 


All the representations of Abelian groups are one-dimensional, 
the representations of the cyclic groups and the axial rotation 
group being particularly simple. The permutation group of any 
order always has the symmetric and antisymmetric representations. 


PROBLEMS 


7.1 Consider the group consisting of all rotations which are 
multiples of 27/n radians, and write down the number representing 
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a rotation by ¢ in the m'® irreducible representation. Thus obtain 
the representations of the axial rotation group by letting n tend to 
infinity. 

7.2 By putting ¢, = ¢, ¢. = ¢ + 4¢ in (7.2), obtain a differen- 
tial equation for x(¢), and hence show that (7.3) gives the only 
single-valued solutions of (7.2). 

7.3 Classify all the permutation transformations of the groups 
Pp, and P, as odd or even, and verify the skeleton multiplication 
table (7.5). 

7.4 Show that the point-group 222 (§ 16) is Abelian. Obtain all 
its irreducible representations by showing that each element can 
only be represented by +1. 


8. The Irreducible Representations of the Full Rotation 
Group 


The infinitesimal rotation operators 


Before proceeding to derive the irreducible representations of the 
full rotation group, it is convenient to express all rotations in 
terms of three operators Iz, Jy, Jz known as infinitesimal rotation 
operators. We can then confine our attention in subsequent work 
simply to these three operators instead of having to deal with 
arbitrary rotations about arbitrary axes. Consider a rotation 
transformation R(«, §) by an angle « about the axis §. From it 
we can define an infinitesimal rotation operator Ig given by 


Lt R(a, &) —] 


a—>0 a 


| Ria, 6) ~1+iele whena <1, | (8.2) 


where we have written 1 for the identity transformation H. Strictly 
speaking iJ, should be called the infinitesimal rotation operator, 
but it is more convenient and is becoming customary to work with 
Ig. R(x, §) can be expressed in terms of J, for an arbitrary angle 
a which is not necessarily small, for a rotation by « is equal to n 
successive rotations by a/n. Thus from (8.2), 


= ils, (8.1) 


or 


oO n 
R(a, &) = Lt (1 +i2%| 


(tag)? 


q 3 
=} ia +S + (vals) 


3! 


+... (8.3a) 
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This series can be summed in a purely formal way, and we can write 
R(a«, §) = exp(tal,). (8.3b) 


However, this should only be regarded as an abbreviated notation 
for the series (8.3a) and in any particular case the exponential has 
to be expanded before it can operate on a function. 

Consider now R(«, §,) when « is very small, where &, is an axis 
in the yz-plane making an angle @ with the z-axis (Fig. 5). This 


a cos @ 


Fic. 5. Change of the co-ordinate axes in @ small rotation R(«, &,) 
where &, is in the yz-plane. 


rotation is approximately equal to consecutive rotations « cos 6 
about the z-axis and « sin @ about the y-axis, correct to terms 
linear in « This can be seen from Fig. 5 which shows the effect of 
R(«, §;) on three points x, y, z at unit distances along the co-ordinate 
axes. R(a cos 0, z) displaces x and y each by « cos @, and R(a sin 6, y) 
displaces z and x each by a sin 6. The total displacement of 
x is « in a plane perpendicular to &,, and thus the effect on all three 
co-ordinate axes is equivalent to the single rotation R(«, &,) when 
ais small. We have 


R(a, &,) = R(« sin 6, y) R(a cos 8, z) + O(a?), 
and hence from (8.1), (8.3) 


If now §& is an axis making direction cosines | = sin 6 cos 4, 
m = sin @ sin ¢, » = cos 6 with the x-, y-, 2-axes, then 


Iz = I' sin 6 + I, cos 6 where I’ = I; cos ¢ + Iy sin ¢, 
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i.e. we have 


Is == lu, + mly + nt. (8.5) 


In fact infinitesimal operators add like unit vectors. This result 
corresponds to the fact that small rotations add vectorially to a 
first order of approximation, and that angular velocities in mech- 
anics add vectorially (Milne 1948, p. 148; Goldstein 1950, p. 126). 
Of course finite rotations do not add like vectors, since they do not 
even commute (equation (4.5), problem 8.1). In conclusion we 
note that with the help of (8.5) and (8.3) any rotation can be 
expressed in terms of the infinitesimal rotation operators Iz, Ty, Tz. 


Commutation relations 


Consider now the physical rotation Rot(«, €,) when « is not small, 
where &, is the axis shown in Fig. 5. This rotation can also be achieved 
by first rotating by @ about Ox which brings the &, axis parallel to 
Oz, then rotating by « about Oz, and then returning &, to its original 
position by a rotation of —@ about Oz. Thus 


Rot(«, §,) = Rot(— 6, x)Rot(«, z)Rot(#, ~). 


From § 2 a physical rotation by an angle +a is mathematically 
equivalent to a rotational transformation of co-ordinates by an 
angle —«. Hence in terms of transformations of co-ordinates, } 


-R(—o, E,) = R(0, x) R(—a, z)R(—8, 2). (8.6) 


Expressing this in terms of infinitesimal rotation operators using 
(8.3) and (8.4), we obtain 


1 — ia(Iy sin 6 + I, cos 6) + O(a?) 
= [1 + i0I, + O(6*)J[1 — tolz + O(a?))[1 — t61z + O(6?)). 
(8.7) 
Now (8.6) is valid for all « and 6, so that we may express (8.7) 


completely as a power series in @ and a and equate coefficients of 
6a, which gives 


—iy — I,lz ara LT. (8.8a) 
By symmetry, we also obtain 
—ty, = Inly — Lylz, (8.8b) 


—il, = Iylz —~— IzTy. 


+ This relation could have been written down directly, but the author 
finds it easier to visualize the composition of physical rotations rather than 
that of co-ordinate changes. 
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In terms of 
I, = Ix -+- ily, I = Ll, om. uly, (8.9) 


these commutation relations become 


(8.10) 


The irreducible representations} 


It was mentioned in § 4 that all proper rotations about all axes 
through a point form the full rotation group, and there is in fact 
no difficulty in verifying that all the group requirements are satis- 
fied (problems 8.2 and 8.3). We shall now take an arbitrary vector 
space X which is invariant under the full rotation group, and start 
reducing it into its irreducible components. Since any rotation can 
be expressed in terms of I;, ly, Iz, it is not necessary to work with 
an arbitrary rotation but only with these three operators. More 
precisely, it follows from (8.3), (8.5) that if a space is invariant and 
irreducible under Iz, Jy, Jz, then it is also invariant and irreducible 
under all rotations and vice versa. Let us first reduce R with 
respect to the axial rotation group about the z-axis, and let um 
be any vector transforming according to the m*® representation 
(7.3).t Then 

Rid, z)Um — efmonn,, 
whence from (8.1) 
letm = Mum. (8.11) 


If um transforms according to the m‘" representation of the axial 
rotation group, then I ,um, I_um belong respectively to the (m + 1)? 
and (m — 1)*® representations, for from (8.11), 8.10) 


Ie(Lm) = (1,12 + 1,)um =(m+1)1,tm (8.12) 


and similarly 
I,(I1-um) = (m aes 1)l_tUm. (8.12b) 


When & is reduced according to the axial rotation group, let j 
be the highest value of m that occurs among the irreducible 


{ This derivation follows closely that given by Van der Waerden (1932). 


{It is convenient to use u instead of ¢ for the vectors of R so as to avoid 
confusion with the angle ¢. 


5 
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components, and let u; be a vector corresponding to this value of m. 
Now since & is invariant under rotations, Ju; also belongs to R and 
from (8.12a) it has the m-value j + 1. However since j is already 
the largest m-value found in R, we must have IJ,u; =0. On the 
other hand by repeated use of J_ we can define from u; a sequence 
of vectors Um with m =j,j — 1,j — 2,... such that 


Um—1 = tml _um. (8.13) 


From (8.12b) um belongs to the eigenvalue m of Iz. am is a non- 
zero numerical constant which we shall determine later such that 
the um are all normalized. From (8.12a), I,u,,_, belongs to the 
eigenvalue m of Jz, and we shall now prove from the definition 
(8.13) that it is actually a multiple of wm. First suppose 


Tin = Cm&nyMmev (8.14) 
where Cm is an undetermined constant. Then from (8.13) 


LUny = ml Lum = aml Lum + 2amlzum 
= Ol _ Cini + May Umn 


= (Cm + 2mM)amum. (8.15) 


Thus if (8.14) is true for one value of m, then by (8.15) it is also true 
for the value m — 1. But (8.14) is true for m =j with c; = 0, so 
that by induction it is true for all m. Further, it is possible to 
calculate the value of cm, for from (8.15) 


Cm—1 == Cm + 2m. 


The solution of this difference equation with the boundary con- 
dition c; = 0 ist 
em =j(j +1) — mlm + 1). (8.16) 


Further if & is of finite dimension, the sequence of vectors um 
must end at some point. Ie. we must have some up, =O with 
Umi #O. Hence I,um =0 and from (8.16) this only happens 
when m = —j — 1 (apart from m =j already discussed). Thus 
the last of the sequence of vectors is u_;, and the number of vectors 
is 23 ++ 1. This is necessarily an integer so that j is an integer or 
half an odd integer. 

The constants «m can now be calculated so as to make all the um, 
normalized. By appendix C, lemma 2, any rotation such as R(+8, x) 


t The solution of difference equations is dealt with in most elementary 
algebra texts, e.g. Durell and Robson (1937), 
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leaves the scalar product of two arbitrary vectors u and »v, or 
wand Rv, invariant. Thus 


| u*R(6, 2)v dr = | R(—6, x)[u*R(6, xv] dr 
de | [R(—6, x)u]*{ R(—8, x) R(8, x)v] ar 
i { [R(—8, a)u}*v dr 
where dr is the volume element. This is true for any value of 6 


so that we may substitute (8.3) for R and equate coefficients of 6. 
Thus : 


| u*(I,v) dr = { (Iu)*v dr. (8.17a) 
A similar result holds for I,, so that 
i u*(L,v) dr = | (I_u)*y dr. (8.17b) 
Now from (8.13), (8.14), (8.17) we obtain 
| Re ig OT Se Wy | (Ly 41), Ar 
= Cm +4 i Um+t Um At 
= Cm(%m4i)? { UmaUms AT. 


Hence, all the uw» are simultaneously normalized if we choose 
om41 = (Cm)~1/*, and (8.13), (8.14) become 


Ttim = VUK(9 +1) — mm + 1) nas, 
= VU(j — mj +m + 1)ms, 


Lum = JUG +L) — mlm — 1m (8.18) 
= VU(j + m)(j —- m + 1) ema, 


Tum = Mum. 


The um are also orthogonal to one another because they transform 
according to different irreducible representations of the axial 
rotation group. This follows immediately from appendix C, lemma 
5, or from the fact that in the reduction of a vector space the 
different irreducible subspaces can always be made orthogonal to 
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one another (§ 5). It can also be proved very simply directly, for 
by appendix C, lemma 2, 


i Untu, dr = ) Ra, 2)[Um*u,] dr 
— pialu—m) i Um*u,, dr, 
whence 
[ tht, dr = Oifm Ay. 


From (8.18) the vector space R'(u;, u;4, . . . U_y) is invariant 
under J,, Z_ and J,, and therefore it is also invariant under all 
rotations. We now show that it is also irreducible. For, suppose 
RR) contains an invariant subspace r, then r would also be invar- 
iant under the axial rotation group about the z-axis and would 
therefore be spanned by a set of the wm. But from one um the 
operators I,, J_ generate all the other ones, so that r can only be 
equal to the whole space R')), Thus the vectors Um,m =j,j —1,... 
—j, transforming according to (8.18), form the basis of an irreducible 
representation of the full rotation group. The different irreducible 
representations D')) are given by the allowed values of j,j = 9, 1/2, 1, 
3/2,2,..., and have dimensions 2} + 1. If a space R transforms 
according to DS), we shall refer to the particular set of base vectors 
satisfying (8.18) as the standard base vectors. 

In this subsection we started with an arbitrary invariant vector 
space R, and set about reducing it. This led naturally to the above 
description and definition of the irreducible representations. 
But it should be noted that we have here in addition a systematic 
way of actually picking out an irreducible component from %&, 
starting with a vector u; with the highest m-value. Having picked 
out one irreducible subspace, we can then orthogonalize all remaining 
vectors to it and start the process again in the remaining vector 
space. Thus % is gradually completely reduced. This scheme is 
described more fully in the next section where we actually employ it. 


Examples 


The spherical harmonics can be defined in various ways, but 
usually arise in quantum mechanics as the solutions of the equation 


1 @, ,2@ 1 a 
(5 a5 * 9x5 + aarsags)Y —>Y (8.19) 


belonging to the eigenvalue A = —i(l + 1) (Schiff 1955, p. 70). 
Here 0, ¢ are spherical polar co-ordinates (problem 3.2). The 
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spherical harmonics Yj; are particular solutions of (8.19) having 


the form 
| Yim = NimPi\™\(cos 6)etm?, | (8.20) 


where Nj» is a numerical normalizing factor, P;!™! an associated 
Legendre polynomial, and m has the 21 +1 integral values 1, 
1—1,...-—1l. The operator in (8.19) is invariant under rotations 
since it is just the angular part of the Laplacian V2, and hence the 
Yim for given J span an invariant vector space Rl) (§ 6, Theorem 1). 
From (8.20) Yim belongs to the m*® representation of the axial 
rotation group about the z-axis; the Yj; are therefore orthogonal 
to one another and R™ has the dimension 21 +1. Now from one 
of them, Yy say, we can define using (8.18) 21 +1 vectors Y'im. 
These span an invariant subspace of R™ which must be equal to 
the whole space because it has the same dimension 21 +1. Thus 
the spherical harmonics Yim transform according to the irreducible 
representation D” of the full rotation group. Moreover since Yim 
belongs to the mt representation of the axial rotation group, 
the Yim transform exactly like (8.18) if we give them the correct 
phase factors (Condon and Shortley 1935, p. 52). r'Y1m can also 
be expressed as a polynomial of degree 1 in 2, y, z. For instance 


rY,, = N(x + ty), rYi.9 = —Nv/2z, 
(8.21) 


rY,_, = —N(x — iy), N = (8/87)1/2: 


1 V 2,2 = Myy/6 (x + ty), r?¥, = —M V6 (2 + iy)z, 
r?Vo,.9 = M432? — 1°), 7?V 9,1 = M36 (x — iy)z, 
r?Y, 9 = MhV6 (x — iy)?, M = (5/4m))2, 


These functions therefore transform according to the irreducible 
representation D” and D), and the signs have actually been 
chosen to make them standard base vectors. Thus x, y, z also 
transform according to D“) but not as standard base vectors. 

An interesting feature of the representations D‘ when j is half 
an odd integer is that they are double-valued, by which we mean 
the following. In §7 we deduced the irreducible representations 
(7.3) of the axial rotation group, and concluded that m must be an 
integer because of the condition x(27) = x(0) and because (0) 
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corresponding to no rotation at all must be equal to one. However, 
for the standard base vectors um (8.11), (8.18) transforming accord- 
ing to D‘?), we have 
R(d, 2)tm = exp(imd)im 
where m is half an odd integer if 7 is. In particular 
R(27, z)um = —Um, 


so that a rotation of 27 about the z-axis makes all the um change 
sign. Since R(2z, z) is physically the same as no rotation at all, 
we have that the identity transformation is represented by two 
matrices, the unit matrix H and also —H#. Similarly by compound- 
ing any rotation R with the identity transformation, R is represented 
by the two matrices D(R) and —D?(R). This leads to no 
difficulties in quantum mechanics because the wave functions ¢ 
and —7 always represent quantum mechanically the same physical 
state of a system, so that we can consider the matrices + D)(R) as 
inducing the same transformation among the base vectors. Clearly 
the wm cannot be ordinary single-valued functions of x, y, z because 


R(2zn, z)f (x, y, 2) =f (x, y, 2). 


In fact we shall see later that the representations D‘/) where j is 
half an odd integer only arise in connection with spin functions. 

We now show how we can calculate in principle the matrix 
D‘?(R) which represents any given rotation R in the representation 
D®, If R = R(«, &) is a rotation by « about the axis &, the matrix 
D‘9(R) is completely determined by (8.18), (8.9), (8.5) and (8.3). 
When discussing such a general rotation it is usually convenient 
not to work in terms of the angle « and the direction cosines of the 
E axis, but to express the rotation parametrically in terms of three 
Eulerian angles ¢, 6, x defined in Fig. 6. Any rotation can be 
considered as shifting the z-axis to OP making polar angles ¢, 9, 
plus a rotation by x about this axis. Thus if the physical rotation 
Rot(«, &) corresponds to the Eulerian angles (¢, 9, x), we have 
from Fig. 6 

Rot(«, &) = Rot(¢, z)Rot(#, y)Rot(x, z). 

Note that OP is not the axis &; it is the direction into which Oz is 
rotated by R(a, &). In § 2 it was seen that a physical rotation by 
an angle a is mathematically equivalent to a rotational transforma- 
tion of co-ordinates by —a. Hence 


R(—a, §) = R(—¢, 2) R(—9, y) R(—x, 2), 
Ria, é) aes [R(—«, €))-? 
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Fic. 6. The Eulerian angles x, 6, ¢. 


We can now calculate the matrix representing R(«, &) in terms of 
the three parameters ¢, 0, x. As a definite example let us consider 
the representation D“/2, From (8.18) 


ed eg 1fl . 
ne[} naff ned ig} oes 


lf. 1 Lis 2 a 
ae t= 5; 1 nasi 1 


and I,* = Iy? = I,? =} 1, 


where we have written D\/2)(J ,) ete. as J, for short, and 1 for the 
unit matrix #H. From (8.3) the matrix representing R(¢, z) is 


D12)(g, 2) = 1 + > (ee 


neven nodd 


= 1 cos #4 + 21, sin 34 
ete 
a eee, 


D/2(6, y) = 1 cos 46 + 21,7 sin $6 


cos $9 sin $0 
zs He sin $8 cos at 


Similarly 
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Hence from (8.22), 


ek, cos $0 sin $0] [et? 
(1/2) = 
D'/?)(a, 8) | : | cs sin 40 cos | | : al 
eftx+4) cog 40 efx) gin 40 


—e~titx—4) sin 40 e~ Ex+4) cos 40 ° (8.24) 


The + has been included because j = 4. It can in fact be seen 
explicitly that adding 27 to y or ¢ changes the sign of the matrix. 


Relationship to angular momentum 


The infinitesimal rotation operators can be expressed directly 
in terms of co-ordinates. If f is a function of several sets of co- 
ordinates %n, Yn, Zn, then from (2.2) and using the notation of (5.3), 


R(a, 2)f (n; Ynys Zn) 
== f (tn COS « — Yn SIN a, Zy SIN & + Yn COS a, Zn) 


| r) a 
= f (Xn, Yn; @n) + & > (< a — Yn i) + O(a), 


whence from (8.1) 


Q- a 
Iz orm = —t > (2 5 a x): (8.25) 


Here we have written J, orp because as we shall see in §11, Zz can also 
operate on the spin co-ordinate of an electron. Until we introduce 
spin co-ordinates, I, and Iz orp can be considered as identical. In 
spherical polar co-ordinates 


<a 
ES > Tok (8.26) 


Now the quantum mechanical operator for the z-component of the 
orbital angular momentum L is (Schiff 1955, p. 74) 


ae 3 b2 (= 5 si ie oe i=) = —ih ie a,’ (827) 


whence from (8.25), (8.26) 
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This result is a special case of the general relation due to Dirac 


[> =m, | =, (8.29) 


for the momentum operator p canonically conjugate to the co- 
ordinate g. The relation is proved in appendix F. The most common 
example of this relation is the usual linear momentum 


ho 
Pa 7 Ox’ 

the ¢ coming in through the definition (8.1) of an infinitesimal operator. 
Equation (8.29) is very important in discussing the angular momen- 
tum due to the electron spin. In the case of the momentum due to 
orbital motion, we have the classical expression Lz = xpy — ypz 
from which (8.27) is derived, so that we have verified (8.28). How- 
ever there is no classical analogue to spin angular momentum, and 
the only satisfactory way of defining it is via (8.29) (cf. Schiff 1955, 
p. 142; Dirac 1958, p. 142). Thus anticipating a little, we can 
define the total angular momentum vector J by 


Jg=Nle, %Jy=hly, Je=—File, (8.30) 


where Iz, Iy, Iz apply to all orbital (or spatial) co-ordinates xn, 
Yn, Zn and all spin co-ordinates (§ 11). 

If we are transforming a function of many co-ordinates zy, Yn, 
Zn, we can consider rotational transformations Ry(«, &) of the co- 
ordinates 1, Yn, 2n, keeping all the other xm, Ym, 2m fixed. These 
are not symmetry transformations of (3.2). Then 


R(a, &) = R,(a, )Rp(a, &)... Ra(«,§)... (8.31) 


Correspondingly we can define infinitesimal transformations I,, 
and Ig, orp depending on whether we include or exclude the spin 
co-ordinate in the rotation. Then from (8.31), (8.2), 


| I, = 2 Jew q, orb — 2 Ten orb: | (8.32) 


The latter decomposition is clearly seen in (8.25), (8.26). Analo- 
gously to (8.28), (8.30) we define the angular momentum vectors 
In(len, lyn, len) and jn(Jan, Jyns jen) of the nih particle by 


len = KI zn orp, Jan = flan, etc. (8.33) 
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These operators should not be confused with the quantum numbers 
Land j. From (8.32) we also have 


Let ym'S) be a wave function transforming according to D‘/) 
under rotations. Then from (8.18), 


(Lz? + I? = Le \bm'P = (32 0_ oT 411, + I )m'? 


=55 + Wm. nee. 


Thus 9) describes an eigenstate of the total angular momentum 
‘with the eigenvalue +/[j(j-+1)]h and with z-component mi. 
Similarly if a function transforms according to D under rotational 
transformations of the orbital co-ordinates alone (leaving spin 
co-ordinates fixed), then it corresponds to a state with orbital 
angular momentum 4/(I(l + 1)]h. 


Further references 


A more detailed discussion of infinitesimal rotations, Eulerian 
angles, the representation D‘/2) and its relation to the Cayley- 
Klein parameters is given by Goldstein (1950, Chapter 4) from a 
somewhat different point of view. As regards the irreducible 
representations of the rotation group, there are also three main 
different approaches as exemplified by Van der Waerden (1932), 
Weyl (1931), Wigner (1931), Murnaghan (1938), Boerner (1955). 


Summary 


The irreducible representations of the full rotation group are 
called D‘?) where j is an integer or half an odd integer. D is of 
dimension 2j + 1, and the base vectors are conventionally chosen 
to transform according to (8.18) and labelled by m =j,j —1,..., 
—j. A wave function transforming according to D describes a 
state with total angular momentum 4/[j(j + 1)]h. 


PROBLEMS 


8.1 Start with a book in some definite position and apply 
successively the rotations Rot(90°, x) and Rot(90°, y) about two 
perpendicular axes Ox, Oy. Show that the final position depends 
on the order of applying the rotations, but that in either case it 
does not correspond to a single rotation by 90/2 degrees about 
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an axis bisecting Ox, Oy as would be expected if the rotations 
added like two vectors. 

8.2 Prove that there always exists a unique single rotation 
Rot(y, C) whose effect is the same as that of two consecutive given 
rotations Rot(«, A) and Rot(8, B). Hint: suppose Rot(«, A) and 
Rot(8, B) finally displace the are PQ on the unit sphere to P’Q’ 
(Fig. 7). KC, LC are the right bisectors of PP’, QQ’. Angle PCQ = 
ZP'CQ’ and 7 PCP’ = £QCQ' = y. 


Fic. 7. Composition of rotations. 


8.3 Using the result of problem 8.2, verify that all rotational 
transformations of co-ordinates satisfy the group postulates of § 4. 

8.4 Let r be the position vector of a point P on a rigid body. 
If the body is rotated about the origin, show that r is carried into 
r’ = T.r, where T is the dyadic tensor, 


vv + (E — wv) cosé+v A Esin@, 


@ is the angle of rotation, v a unit vector along the axis of rotation, 
and E the unit tensor (Milne 1948, p. 36; Zachariasen 1945, p. 242). 
Hence show that Rot(47, x) Rot(4v, y) = Rot($7, &) where & is 
equally inclined to the positive z-, y- and z-axes. What single 
rotation is Rot(47, y) Rot(47, x) equivalent to ? 

8.5 Draw a figure from which the truth of equation (8.5) may be 
derived directly in the same way as (8.4) follows from Fig. 5. Also 
prove (8.5) analytically using tensors of the form given in problem 8.4. 

8.6 Calculate the coefficients of 6? « on both sides of equation 
(8.7) and show they are equal. 

8.7 Starting with J, J_, I, expressed in Cartesian co-ordinates 
(cf. equation (8.25)), express them in spherical polar co-ordinates 
using the relations between the two co-ordinate systems (problem 3.2). 

8.8 Verify directly that I,, I_, Iz expressed in Cartesian co- 
ordinates satisfy the commutation relations (cf. equation (8.25)). 
Also verify that the matrices (8.23) do. 
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8.9 Express J,, I_, Iz, in Cartesian co-ordinates like (8.25) 
and hence verify that the functions (8.21) transform as standard 
base vectors (8.18) according to the irreducible representation 
D™ of the rotation group. In particular verify that I,(x + iy) 
== I_(~ — ty) = 0, and also (8.35). 

8.10 From the definition (8.1) show that 


Iguv = (Igu)ju + u(T gv), 


so that J, behaves like a single differentiation as regards its operation 
on a product function wv. 

8.11 The vectors u,1/2, U_y/_ (abbreviated to u,, u_) transform 
as standard base vectors (8.18) according to the representation 
D\/) of the rotation group. Show from problem 8.10 that the 
vectors 

ul; +myj-m 


~ [G+ mG — my 


transform as standard base vectors according to the irreducible 
representation D‘), 

8.12 Apply the method used to obtain (8.24) to calculate 
D6, x) referred to standard base vectors. Check the result by 
writing down the matrix representing the effect of R(@, x) on the 
functions x, y, z, and then transforming it to standard base vectors 
using (5.15). 

8.13 Show that (8.24) is a unitary matrix with determinant 
+1, and conversely that any 2 x 2 unitary matrix can be written 
in the form (8.24). 

8.14 Express the operator of (8.35) in spherical polar co-ordinates. 
What relation does it bear to the Laplacian V2? Hence verify 
(8.35) for the spherical harmonics. 

8.15 Consider 


a b —2 y+tix| |a*® —b = — 2’ y’ + 12’ 
— b* a*| |y — ix 2 b* al |y’ — ia’ z' : 


where Up = E - is unitary with determinant -+1, and where 


um 


“, y, z are real. Prove: (i) 2’, y’, z’ are real, (ii) 2’? + y’2 + 2? 
= 2? + y? + 2%, (iii) (U,U,)* = U,*U0,*. Hence show there is a 
homomorphism (appendix B) between the matrices Up and the 
rotational transformations R of co-ordinates from (x, y, z) to 
( 2, y’, z'). 
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8.16 In (8.1), note that iJ, is a real operator, in the sense that 
il, p is real if y is real. Hence take the complex conjugate of (8.18), 
and show that the functions %,* transform under rotations in 
exactly the same way as the functions (—1)/-™u_m. 

8.17* Discuss the relationship between the full rotation group 
and the group of unitary 2 x 2 matrices with determinant +1. 
Show how this can be used to derive the irreducible representations 
of the full rotation group (Van der Waerden 1932, p. 57). 

8.18 Derive a general expression for the elements D,,'!'(x, 0, ¢) 
of the matrix representing a rotation by Eulerian angles x, @, 
¢ (8.22) in the representation D‘J), Particularize the answer to 
the rotation R(6, x) and compare with the result of problem 8.12. 
Hint: use problem 8.11 and equation (8.24). 


9. Reduction of the Product Representation DY x DU 


Product representations 


In quantum mechanics it frequently happens that we wish to 
multiply wave functions or other functions by one another. For 
instance it may be convenient to express a two-electron wave 
function ¥(r,, r.) in terms of products ¥,(r,)~2(r,) of two one-electron 
functions ¥,, ¥,. Similarly when calculating a quantum mechanical 
matrix element [y;*xy; dr we multiply three functions together. 
Let us therefore consider the vectors Us, U1, Up, U1, Ug and 2, 
Vo, V_, transforming respectively as standard base vectors according 
to the representations D‘?) and D of the rotation group. Then 
we can form the 5 x 3 = 15 different products wmv,, and these 
transform into one another under rotations so that they form the 
basis of a representation of the rotation group. This representation 
is written symbolically as D' « D®. 


Reduction of a product representation 


The vector space of dimension 15 spanned by the u,v, is called 
a product space. It and the representation D‘) x D® can now 
be reduced using the method of the last section which we may 
summarize as follows: 

(i) Reduce the space according to the axial rotation group about 
the z-axis, so that each base vector belongs to a representation 
exp(tM¢) for some value of M. 

(ii) Pick out the vector (or one of the vectors) Uy with the largest 
value of M, this value being J. 

(iii) Using the operator ZJ_ and (8.18), define the vectors U,, 
Uj;.,,-..U_, transforming according to DY). 
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(iv) Orthogonalize all other vectors to the Uy,. This can be done 
without combining functions with different M since such functions 
are automatically orthogonal. 

(v) Repeat the process (ii) to (iv) in the ee space remaining 
until the whole space has been reduced. At each stage the vector 
space remaining is still invariant under all rotations by appendix C, 
lemma 3. 


In the present case the space (..., wmv,, . . .) is already reduced 
according to the axial rotation group about the z-axis. For 
Rp, Z)UmY, = (eu) (eFv,,) = ef™+ Hou ny, (9.1a) 


so that the base vector u,,v, belongs to the value 


Thus all the base vectors with their values of V written underneath 
them are: 


Ugr, Ugo Ugd_y UY, UyzU% Uyv-y 
M = 3 
Ug? Ug UpV_y  UqVy Ug UV, 
M= 1 0 —1 0 —1 —2 
Wie, = Ugo UgVi1 
M = —1 —2 —3 


The vector U, = u,v, has the largest value of M namely 3, and from 
it using (8.18) a vector U, with M = 2 may be constructed. This 
will not be uv, or u,v, but a linear combination of them. At the 
present we are not interested in just what the correct linear com- 
bination is. The important fact is that whatever the linear com- 
bination, we obtain in step (iv) one other linearly independent 
vector, U’, say, orthogonal to U, because the value M = 2 occurs 
twice in our table. Similar considerations apply for the other 
values of M. Thus we can tick off on the list once each M = 3, 
2, 1, 0, —1, —2 corresponding to a set of vectors U y transforming 
according to D‘), There remains (step (iv) above) a set of vectors 
U'y with the values M = 2, 1, 1, 0, 0, —1, —1, —2. Similarly 
from among these we can construct a set transforming according 
to D@ and then a set according to D“), which uses up all the 15 
linearly independent vectors. Thus 


D® x DM = Dis + D@ 4 Ya, 


Similarly if two sets of vectors transform according to D‘?) and 
DS), the product space transforming according to DY « Dit) 
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contains the value M=+(j+ 7’) once, M=+(j+j’—1) 
twice, M = +(j7 +7’ — 2) three times, etc. The values M where 
—|j-—j'| < M < |j —j'| are contained 2) +1 times each if 
j <j or 2j’ + 1 times if j’ < j. Proceeding in the same way as 
above, base vectors can be found transforming according to the 
representations DY) withJ =j +j’,j+j’—1,..., |j —j’|,and 
thus 


D® x DY) == DU+i) 4 DG+H-V 4 DG+- 9 4. 4 Dis", 


(9.2) 
For example 
D® x DA — Hts/2) 4. Hrs 
D® x DY x DY = DM x (De 4 Ya + Yio) 
= D® + DO@ 4+ Dw + De® i+ Dm 
+ DO + DM, (9.3) 


The new base vectors 


In simple cases there is no difficulty in carrying out the procedure 
of steps (i) to (v) above in complete detail and obtaining the actual 
base vectors transforming according to the different irreducible 
components DY). For instance if u,, wp, u_, and v,, vo, v_, each 
transform according to D), the product representation is D“ 
x DY = DM + DO + DPD, There is only one vector U,‘® 
= Uv, with M = 2. From problem 8.10 and equation (8.18), we 
can define the next vector U,'®) by operating on U,'®) with I_: 


I_U,!) = (I_u,)v, + u%4(I_2,) 
= 21/2(ugv, + U2) 
= 2U, by definition (8.18). 


In this way we obtain the following base vectors U y) transform- 
ing according to D'®, D™ and D0. 


U9 = New 
0,9 = (N/21/2)(ugvy + UX) 
Uy? = (Nq/61/2)(u_,0, + 2upvp + uv_1) (9.4a) 


U_,'?) = (N,/21/2)(u_,vy + uv_y) 

Uv.” = Nyu_v_; 

Cy) = (N,/21/2)(ugry — 4%) 

Ug = (N,/21/?)(u_yvy — uyv_s) (9.4b) 
U_ = (N,/2¥?)(u_yv — Upv-1) 


Us? = (Ng/31/?)(u_vy + UyV-4 — Up). (9.4c) 
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Here N,, N,, No are normalizing constants. 
If (a, y,, 2) and (a, Ya, Z,) are the components of two ordinary 
vectorst r, and r,, we can from (8.21) put 


Uy =X + yy, Uy = —2'/%2%,, Uy = —(%, — ), 


; ; 9.5 
Vy = y+ Ya, Ug = —21/¥2q, v_y = —(%_ — 1p). ve) 


The component (9.4c) transforming according to D,'® then 
becomes —(24/3/3)(24,%_ + Y1Ye + %2%_), Which is proportional 
to the scalar product r,.r,. The components Uy‘! become 
— (%4%_ — 21) + U(Yy%_ — Yor) and —ty/2(azyy_ — %y,), so that 
from (8.21) the quantities 


(Y1%2 — Yor), (%%q — 22), (%1Y2 — LeYj) 


transform under rotations in the same way as the components 
(a, y, 2) of an ordinary vector. As expected, they are the components 
of the vector product r, A r,. Consider now the components 7'y 
of a second rank tensor. By definition they transform in the same 
way as the products ryi23 where ru = 2, ¥1, 2 for + = I, 2, 3, etc. 
Hence the nine 7; form a vector space of dimension nine trans- 
forming according to DY x D® = DM + DM + D©, The DY 
component is the scalar 7, + 7's, + 733, and the quantities 
T x3 — 132, T31 — T13, Tig — To, transform like an ordinary vector 
according to D™ (cf. Milne 1948, p. 46). The remaining five linearly 
independent combinations 7,3, + Ty, Ts +713, Ti+ Ta, 
T11 — Tex, To. — T33 transform according to D‘), and form a 
symmetric second rank tensor with a zero sum of diagonal elements. 

It is sometimes convenient to have the relations (9.4) expressed 
the other way round. We can solve them as simultaneous equations 
for the umv,, and obtain 


Ug = (1/24/9)(U HN, + Uy /N)), 


gy = (1/3*/2)(24/2U g/g — Uo\/No), (9.6) 
ug. = (1/24/2)(U_/N, — U_0/N,), ete. 


Wigner coefficients 
The coefficients occurring in (9.4) and (9.6) are particular examples 
of certain general coefficients known as Wigner coefficients. Let 
Um: —j S ™ < 3, and Um’, —j' < m’ < ce 


{ We shall refer to the common type of vector in three-dimensional space, 
such as the position vector r of a particle, as an ordinary vector, to distinguish 
it from the more general concept of a vector in a more general vector space. 
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be two sets of standard base vectors transforming respectively 
like (8.18) according to D') and D'S), Then by the procedure of 
steps (i) to (v) above, we can find in the product space unum several 
sets of vectors 


Uy = Ny > (jj'mm' |I Mun ry (9.7) 
m, m’ 


transforming according to DY), where from (9.2) J =j +9’,j +j’—1, 
...|g—Jj'|. The coefficients (jj’mm’|JM) are the Wigner 
coefficients, also known as Clebsch-Gordon or vector coupling 
coefficients. Now our method of constructing Uy,“ is unique 
and, therefore, apart from the factor N;, the Wigner coefficients 
as defined are uniquely determined. This is the most important 
fact about them: the coefficients do not depend on the detailed 
nature of the Um and vm which may be complicated many-electron 
wave functions. For instance we already showed from (9.1) that 


(ij’mm’|JM) = 0 unless M =m +m’. 


Incidentally if we had chosen base vectors uy, vs, Uz not trans- 
forming in the standard way (8.18) but with some other definite 
transformation properties, the coefficients relating them would 
still be uniquely determined, apart from the N,, by these trans- 
formation properties. The coefficients would not depend on the 
particular form of the functions, but just on their transformation 
properties. For instance, in the example (9.4) we might have used 
L4, Yr» 215 Xo, Yor Zq a8 base vectors instead of the standard linear 
combinations (9.5), and this would have given somewhat different 
but still uniquely determined coefficients. 
As before, the equations (9.7) can be inverted to give 


tint = (ji'mm’ |TM\(LN)U ae. (9.8) 


Actually it is not obvious that the coefficients in (9.8) are the same 
as those in (9.7), but we shall prove in § 20 that with appropriate 
choice of Ny this always is so: compare for instance (9.4) and (9.6). 
We shall also derive a general formula for (jj’mm’|JM) in § 20, 
but it is too cumbersome for general use so that numerical tables 
of Wigner coefficients are given in appendix I. 


Summary 


If two sets of vectors wm and ty transform according to D')) 
and D'S), then the product space spanned by the vectors tmm 


6 
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transforms according to a representation DS) x DS which is 
reducible into the components DY given by (9.2). If all base 
vectors are chosen in the standard way according to (8.18), the. 
vectors transforming according to the different D) are given in 
terms of the umvm by the Wigner coefficients. 


PROBLEMS 


9.1 From (9.2) write down the irreducible components of 
DY x DO, DO y DU, Hay yx Day Ya yx Daw yx Har, 

9.2 Derive all the vectors (9.4) and (9.6) in detail, and verify 
that the same coefficients are involved whichever way round ((9.7) 
or (9.8)) one writes the relationship between them. 

9.3 Find the vectors Uy) in the space umv’m where the um 
and V4/9, V_sg transform as standard base vectors according to 
D® and D@/?) respectively. 

9.4 What happens to the vectors (9.4), (9.5) if we put 2, y, 
2 = Xa, Yo, 2? Show that this situation is an example of the follow- 
ing general theorem, and construct another illustration. Theorem: 
if a set of linearly independent vectors ¢; and another set of linearly 
dependent vectors yy; (i.e. linearly dependent among themselves, 
not linearly dependent on the ¢;) transform with the same matrices 
under a group of transformations, and if the ¢; transform according 
to a reducible representation D@ + D®) 4+ DM4 ... 4+ D”, 
then the %; transform according to a representation > D‘, where 
the D‘r) are some of the irreducible components out of the set 
D®, D'®, D™, ... D”, For a proof of this theorem, see Van der 
Waerden 1932, p. 74. 

9.5 Show that the Wigner coefficients (jj’mm'|JM) = 1 when 
M=m+m=4+J=4+(94+ 7). 

9.6 Using the operators J,, J_, Iz, show that > (—1)™u_mvm is 


m 
invariant under rotations, where Um, Um are two sets of functions 
each transforming according to D?. Note that (9.4c) is a special 
case of this. 
9.7 Show algebraically that 


gy 
©, OF +1) = Bi + VQ" +0). 
IF , 
What is the significance of this result in connection with equations 
(9.2) and (9.7)? 

9.8 Tyis asymmetric second rank tensor (7'4y = T'4) such that the 
sum of the diagonal elements is zero. Show that the components trans- 
form under rotations aecording to the irreducible representation D™). 
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10. Quantum Mechanics of a Free Atom; 
Orbital Degeneracy 


The total Hamiltonian # of a free atom can conveniently be 
divided into three parts 


KH =How + H spin +H mei (10.1) 


where 


Here Ze is the charge of the nucleus, and the number of electrons 
which is not equal to Z if the atom is ionized. # spin depends on the 
electron spin and will be discussed in detail in the next section. 
HA nyo, takes into account the motion of the nucleus, its finite size 
and the deviation of the potential from a pure Coulomb field near 
and inside the nucleus, and also its spin, magnetic moment, quadru- 
pole moment, etc. It is small so that we shall neglect it completely 
in this chapter, but return to consider it in § 21. 

Now even the simplest part of the Hamiltonian, # orp (10.2), 
does not have eigenfunctions that can be expressed exactly in a 
convenient closed form, though with present day computing mach- 
inery it is possible to get numerical solutions with a high degree of 
accuracy. We shall therefore of necessity be concerned with approxi- 
mation procedures and perturbation theory. In particular the wave 
functions we derive will usually be only the first terms in a complete 
expansion of the wave function, but for many purposes this is 
quite adequate. However, it is important to realize that this does 
not mean the whole theory will be approximate. The theorems of 
§ 6 allow us to make precise statements about the transformation 
properties of wave functions, even though these wave functions are 
unknown, very complicated, many-electron functions. Consequently 
anything that depends purely on transformation properties can 
be discussed exactly. Examples of such things are the degeneracy 
of energy levels, qualitative splittings by perturbations, the intro- 
duction of spin angular momentum, selection rules for transitions 
between various levels, and the relative strengths of a group of 
spectral lines. In this context the wave functions we shall write 
down are just aids to the imagination. On the other hand, if one 
wants to calculate such things as the actual energies of certain 
levels or the absolute intensity of a spectral line, one does require a 
knowledge of the wave function. In this connection, therefore, we 
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shall need to write down approximate wave functions and to have 
systematic ways of obtaining more accurate ones when required. 

In short the main features of our group-theoretical discussion 
in the remainder of this chapter are the following: (i) the rigorous 
arguments depending on symmetry properties are clearly separated 
from the approximations inherent in any particular wave function; 
(ii) in particular the treatment of spin angular momentum is made 
rigorous; (iii) the argument applies generally to an atom with any 
number of electrons. 


Self-consistent field 

We shall now start to consider (10.1) in the usual quantum 
mechanical manner, first the largest terms of #,,,, then the next 
smaller ones as a perturbation, and then still smaller ones in a 
definite sequence. As a first approximation we regard each electron 
moving independently in the average potential, considered as fixed, 
of the other electrons. Such independent motion corresponds 
to a single product wave function y% of one-electron orbitals ¢4;; 


= $1(1) pelle)... Pn(En)- (10.3) 
We now use the variational principle to calculate the ¢; that makes 


y% come closest to the correct wave function. This principlet 
states that the lower the energy E of the wave function 


E= Sb om bdr 

jebdr 

the closer ¥ is to the true eigenfunction of #,,. The lowest energy 

obtainable with y having the form (10.3) can be calculated, and it is 

found that the ¢; are given by the set of » Hartree equations 
_ (Schiff 1955, p. 284) 


(10.4) 


hi? 
|- w vet raro| bel) = Exiled), (10.5a) 
Fer 2 
Vie =— + 5 | stele any (10.80) 


As anticipated, the potential energy V; is the expectation value 
(10.5b) of the electrostatic potential energy due to the other electrons 
and the nucleus. It is called the self-consistent field because through 
it each ¢; depends on every ¢,, so that we have to obtain simul- 
taneously a self-consistent set of solutions ¢; of all the equations. 


t This is a very rough formulation of the variational principle, but it suffices 
for the present purposes (Schiff 1955, p. 171). 


QUANTUM THEORY OF A FREE ATOM 15 


Vi(re) as defined by (10.5b) is not actually spherically symmetrical, 
but for practical purposes of computation it is made so by averaging 
over all directions for fixed r;, and we shall assume that this has 
been done. (10.5) is then invariant under rotations so that we 
obtain degenerate sets of eigenfunctions transforming according 
to D0). These in fact have the form (Schiff 1955, p. 69) 


| P(t) = fng,(14) Yizm,,(1, %), | (10.6) 


where the spherical harmonic Yim_ shows that they transform 
according to D4), If the radial function f has ny —  — 1 nodes in 
it, it is designated by the principal quantum number m and the 
orbital angular momentum quantum number l; The latter derives 
its name from the fact that from (8.35), ¢; has angular momentum 
Vik + 1)}t. Instead of using J, an orbital is usually specified 
using the spectroscopic notation by a symbol nz, where n is the 
principal quantum number and 2 is the letter 


8,9, d4,f,9,h,...forl =0, 1, 2.3,4,5,... (10.7) 


A state of the form (10.3) is specified by a symbol such as (18)?2s(2p)? 
where the indices denote the number of 1s etc. orbitals occurring. 
In general the orbital eigenvalues H; of (10.5) in order of increasing 
energy are Is, 2s, 2p, 3s, 3p, 3d, 48,... , the ls orbital having the 
lowest energy because it lies closest to the nucleus. Thus to ob- 
tain the lowest energy for (10.3), we would expect to take each 
of the ¢; as a ls orbital. However as we shall see in §12, the ex- 
clusion principle prevents the atom from collapsing in this way. 

The spectroscopic notation does not specify an orbital or a set of 
orbitals (10.3) completely, for it leaves each of the m; arbitrary. For 
instance a symbol such as (1s)?2s(2p)? corresponds to a whole set of 


(21, + 1)(2lp +1)... (2 +1)... (10.8) 


wave functions (10.3) given by all possible combinations of the m,;. 
Since the orbitals (10.6) are degenerate, these wave functions all 
have the same energy (10.4). They are known collectively as a 
configuration, and this name is sometimes also applied to their 
energy. All the wave functions (10.3) of one configuration are 
eigenfunctions of a central self-consistent field Hamiltonian 


h2 
H osot = — om 2.¥4 ++ 2.Valrs). (10.9) 


They all belong to the same eigenvalue of ogc, 08 follows from the 
fact that (10.9) is invariant under separate rotations of the individual 
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electron co-ordinates x4, yi, 2; alone. All the wave functions can be 
generated from the one with every my = i by using the operators 
(I_): orp Of (8.32) and the relation (8.18). 


Energy levels of # orp 


Since the configuration of wave functions (10.3) are degenerate 
eigenfunctions of # ogee, we obtain the energy levels of # oy by 
applying 

Hog =H ory — # cact (10.10) 


as a perturbation. This represents the difference between the true 
electrostatic interaction between the electrons and the self-consistent 
field. It will produce a splitting of each configuration, because the 
total Hamiltonian #,,, is now no longer invariant under separate 
rotations of the co-ordinates 2, yi, % alone, so that the degeneracy 
(10.8) no longer applies. However, #,,, is invariant under simul- 
taneous rotations of all the co-ordinates 2;, yz, z. Under these 
rotations the wave functions (10.3) transform according to the 
reducible representation 


DY) x D® x... x Di) => DM, (10.11) 


so that by Theorem 3 of §6 the configuration is split into a set of 
levels according to the different values of Z occurring in (10.11). 
For example the configuration (1s)?2s2p3p would give levels with 
L = 2,1 and 0. 

Using the Wigner coefficients (9.7), we can write down the actual 
linear combinations transforming according to D™, We first 
combine the ¢,(r,) and ¢,(r,) to form the functions 


> (hlgmym, | L,M») Pn imylt 1) Pnalamy(F2) 
mums 
transforming according to D2) where Dt) x Dt) = ¥ Dia) 
(9.2). These are then combined with the ¢,(r,) to form functions 
transforming according to Ds) where Di) x Dis) = > Dis), 
These are then combined with the ¢,(r,) etc. until we reach Dy 
= L. The wave functions then are 


rm, = d(hile.. | Le-)(Lels . . |Ls-)(Lelg.. |La-).-. 
(Lynaln » . | LM) m,(01) om,(Fe).--dmy(Fn). (10.12) 
Here we have dropped for simplicity all but the most important 


suffices. The summation is over all combinations of the m, such 
that 2m; = M,. Now, although the wave functions (10.12) have 
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the correct symmetry properties, there is no reason why they should 
be eigenfunctions of # ;», and in general, we have to write the 
correct eigenfunctions in the form 


bem (lile Pe Ln) 
= (zu, given by (10.12)) 
+ dan(z, from other configurations). (10.13) 


Here a + 1 and the other coefficients «, are usually small. The 
mixing in of wave functions from other configurations is called 
configurational interaction. Since the whole wave function has to 
transform according to D“, only terms with the same ZL can give 
contributions to (10.13). When the energy difference between 
various configurations is comparable with the splittings produced 
by #,,, then the corresponding coefficients in (10.13) are all large 
and the configurational mixing is important in calculating the 
energies of the various levels. However, their number and J values 
cannot be affected, since by Theorem 3 of §6 these depend only on 
symmetry properties. In (10.12), it happens occasionally that we 
obtain two unrelated sets of functions with the same L from one 
configuration. In this case we have to take linear combinations 
between them to obtain even approximate eigenfunctions, and 
both sets will appear in (10.13) with large coefficients. 


Summary 


The Hartree self-consistent field equation (10.5) is invariant 
under rotation of the co-ordinates x;, y;, 24, 80 that its eigenfunctions 
transform according to D) and its energy levels are (21; + 1)-fold 
degenerate. The central self-consistent field Hamiltonian Weget 
(10.9) is invariant under separate rotations of the co-ordinates of 
the different electrons, so that its energy levels have the degeneracy 
(10.8). The Hamiltonian #oyp (10.2) is invariant under simultaneous 
rotation of all electron co-ordinates, its eigenfunctions transform 
according to D and its energy levels are (2Z + 1)-fold degenerate 
where L is given by (10.11). 


PROBLEMS 


10.1 What values of Z could the following configurations give 
rise to: (1s)?2p3s3d, (1s)?(2s)2(2p)?, (1s)?(28)?2p3p, (1s)?(2s)?(2p)8. 

10.2 Using the Wigner coefficients of (9.4), write down the wave 
function (10.12) transforming according to D(® in the configuration 


(18)?(2s)?(2p)?. 
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10.3 Using problem 9.5, write down the wave function ¢,y, with 
L = M =3 in the configuration (1s)?(2s)3(2p)*. By operating with 
I_ and using (8.18), generate the other wave functions with L = 3. 

10.4 An atom is in a state with L =0. Show that it has a 
spherically symmetrical charge distribution even when configura- 
tional interaction is fully taken into account. 

10.5* Discuss the degeneracy of the states of a hydrogen atom. 
Note that the pure Coulomb 1/r nature of the potential introduces 
an extra symmetry into the Hamiltonian, which in turn is res- 
ponsible for the extra degeneracy (Fock 1935). 


11. Quantum Mechanics of a Free Atom including Spin 


The electron spin 


In the 1920’s experimental evidence accumulated which indicated 
that electrons cannot be described correctly by the Hamiltonian 
H ory (10.2), or rather its equivalent in the old quantum theory 
of the time. It was found necessary to make the following additional 
assumptions, which have no classical analogue, about the nature of 
electrons. 

I. Each electron has an intrinsic (or internal) angular momentum 
s of magnitude 4/3%. This is called its spin angular momentum, 
or spin for short. 

II. The component of s along any direction can only have the 
values +4h. It is usual to choose a definite z-axis and to describe 
the state of the electron using an additional or intrinsic co-ordinate 
az, where oz = +1 corresponds to the component s, being +4h. 
This is called the spin co-ordinate. 

III. In general the effects of spin are small compared with the 
Coulomb repulsion between electrons, or more precisely with the 
self-consistent field of (10.5). 

IV. The electron has a permanent magnetic moment (—e/mc)s. 


An electron circling in an orbit with orbital angular momentum 1 
has an effective magnetic moment (—e/2mc)l due to the purely 
classical circulation of the current. We note therefore that the ratio 
of spin magnetic moment to spin angular momentum is twice as 
large as the ratio of orbital magnetic moment to orbital angular 
momentum. We shall not discuss here the actual experiments, 
such as the Stern-Gerlach experiment and the Zeeman effect in the 
spectra of alkali atoms (see for instance Born 1933), that originally 
led Uhlenbeck and Goudsmit (1925) and others to these assumptions. 
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Suffice it to say that in the course of time the assumptions have 
been found to give a description of the nature of electrons which is 
in complete accordance with experiment apart from some relativistic 
and radiative corrections. 

Actually the assumptions I to IV above are not logically indepen- 
dent, and we shail only make the much more limited assumption : 


An electron has some internal (or intrinsic or spin) degree 
of freedom which gives tt direction-dependent properties. 
The co-ordinate, oz say, describing the internal degree 
of freedom can only have two values, oz = +1 say. (11.1) 


From this we shall derive in the present section the existence of 
the intrinsic spin angular momentum and its magnitude (items I 
and II above). Later in § 32 we shall look for a relativistic description 
of particles having such a spin momentum in ordinary space, and 
shall arrive at the Dirac equation from which the magnetic moment 
and its interactions follow (items III and IV above). Though 
logically sounder, it would be very inconvenient to defer all dis- 
cussion of spin-orbit coupling, etc., till after § 32, so that in the 
present section we shall anticipate the results III and IV above, and 
base our discussion of spin-dependent forces on them. 


Spin operators 
From our assumption, we must write a wave function 


p(x, ¥, 2, 2) (11.2) 


for one electron as depending, not only on the orbital (or positional 
or spatial) co-ordinates 2, y, z, but also on the spin co-ordinate oz. 
Let us now operate on % with a rotation R(«, —). Going back to 
our fundamental definition of what this means (§ 5), we first refer 
y% to new axes OX, OY, OZ. Now the co-ordinate oz is defined 
with respect to a definite axis Oz, and will therefore be expressed 
also in terms of a new co-ordinate oz with respect to the new axis 
OZ. We then replace X, Y, Z, oz by 2, y, 2, oz again throughout 
to obtain Rid(zx, y, z, oz). From this description, it is seen that R 
can be divided into two independent transformations 


R(a, 8) = Rorn(a, E) Rspin(«, E), (11.3) 


where Rorp expresses only 2, y, 2 in terms of the new axes and Regpin 
only oz. As in (8.2) the rotations can be expanded in powers of « 
with the infinitesimal rotation operators, and (11.3) becomes 


1 + tol, + O(a?) = (1 + tol orb + cae ye! + tal, spin + oe we 
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Equating coefficients of «, we obtain 


Tg = Ig orp + Le spin, (11.4) 


where J, orn operates only on x, y, 2 and Lg spin only on gz. 

We can now show that an electron has a spin (or intrinsic or 
internal) angular momentum. In the absence of a classical analogue 
to spin, the angular momentum has to be defined by the funda- 
mental Dirac relation 


Py =Filg. [see (8.29) or (F.2)] 


Thus the total angular momentum of an electron is given by 


je = NL, = hI, om + Ag spin = |e + 85, (€ = 2, y, 2) 


(11.5) 


where the /, are the components of the orbital momentum 


a 7) 
L, = —in(y a 75), etc. (8.27) 


as already defined in § 8, and where 


8 =Algapin (€ = 2, Y, 2) (11.6) 


are the components of an additional, non-classical, spin angular 
momentum s. The magnitude of the spin momentum will be derived 
in (11.14). Similarly for several electrons, their combined total 
angular momentum J is the sum of the combined orbital and spin 
angular momenta L and S: 


J. = Al, — hI, orb + hl, spin = DL, + Se, (11.7a) 


where LD, = hl, orb — 2 = > Rly orb> 
t 


Sp = Le spin = > 44 = > File spin. (E = %, Y, 2) 
3 t 


(11.7b) 


Alternatively J can be split up in terms of the total orbital pie spin 
angular momenta j; of the individual electrons: 


Je = hl, = Di hile = dja =D (le + Se). (§ =2, Y, 2) 
s ‘ : (11.7c) 
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Spin-orbit coupling 
We can now calculate # spin, the spin-dependent part of the total 
Hamiltonian (10.1), expressing it in terms of the above operators s;. 
Consider an electron in the field of the nucleus, the electron being 
at the position r relative to the nucleus which is considered to be 
at rest. An observer moving with the electron would see the nucleus 
moving past him with velocity —v, generating because of its motion 
a magnetic field H = (Ze/er?)(—v) A r at the electron. This inter- 
acts with the magnetic moment m = —es/mc of the electron, giving 
a contribution 
Le* 
—m-:H = mcr? I-s 
to the total energy, i. to the Hamiltonian. Here we have put 
1=mr/ v. The fact that we want the Hamiltonian expressed 
in a frame of reference in which the atom is at rest gives an additional 
factor of 4 (Thomas 1926). We also obtain corresponding terms 
for the interaction of the electrons among themselves, and have 
therefore the following contribution to the Hamiltonian (Condon 


and Shortley 1951, p. 211; Heisenberg 1926) 
Ze* ls; 
#is = 2m?c? » r3 


ene (ti — Ty) A (ve — Vy) 1 (te — 55) A Ve 
mi | age ee 


(11.8) 
This contribution to #spin is known as the spin-orbit coupling 
because it depends on the orbital motion of the electrons as well 
as the spin. We shall see in § 32 how the spin-orbit coupling for one 
electron comes out of the relativistic Dirac equation without putting 
in the value of the spin magnetic moment. In addition there is the 
spin-spin coupling which is the direct magnetostatic interaction of 
the electron magnetic moments 


2 
H 35 = a > - 81°Sy/r4y> — 3(Si-T4y)(Sy°T1s)/74y°. (11.9) 


t<j 


Comparable in magnitude with (11.8) and (11.9) is also the classical 
magnetic interaction between the electron orbits viewed as currents, 
but we shall neglect this, since it leads to no very important effects. 


Spin-dependent wave functions 
In ¢ (11.2) the variable o, is only allowed to have the discrete 
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values o, = +1, in distinction from the other variables z, Y, 2 
which vary continuously. There are three common ways of writing 
a function of a discrete variable like g,. They are all three com- 
pletely equivalent and we shall use them interchangeably according 
to convenience. Let us introduce them by examples. Let N(a) be 
the number N of children in a certain class whose age was a years 
last birthday. Then M(a) is mathematically a well defined function 
of the discrete variable a which by its definition only takes integer 
values. Our writing N(a) and thinking of it as a function of a is 
the first way of writing such a function and corresponds to the form 
(11.2) for % which we have already used. The second way consists 
of thinking of N(a) as a table of values, for instance: 


a= <6 7 8 9 10 Ib 312 
N(a) = 0 2 10 #183 6 1 0. 
This also specifies the function completely, and similarly we can 
express 7 (11.2) in terms of two components in a little table: 
Cz = +] —! 
p(x, Y, 2, Oz) = ps(2, Y, 2) p_(x, ¥Y; z). 
Here the entries %,, y_ are still functions of the continuous variables 
x, y, 2, but they are two ordinary functions independent of one 


another and of oz. Thus the second way of writing % is in com- 
ponent form 


p(x, ¥, 2, oz) = [b,(x, y, 2), p_(z, y, z)], (11.10) 


remembering that the two components refer to 0, = +1. Consider 
now the position vector r of a point in ordinary space. We write it 
as (x, y, z) in terms of three components analogously to (11.10). 
However we sometimes consider the vector r as a single entity in 
itself and then it may be written zi + yj + zk in terms of the 
unit vectors i, j, k along the three co-ordinate axes. Similarly we 


write 
(a, ¥; 2, Sz) = h(x, Yy; zu, + p_(x, Y, 2)U_, (11.1la) 


where u,, w_ are functions of oz only, defined by 


ui(+ 1) = 1, u,(— 1) =0; u—(+ 1) =0, u(— 1) =1. 


(11.12) 
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Clearly (11.1la), (11.12) is the same function defined by our table, 
and this is our third way of writing » (11.2). w, and u_ are called 
spin functions or spinors. An n-electron wave function # is usually 
written in this third way as 


p= = Pap...v(Lrs Yrs 2, - +» Uns Yns Zn)Uetpg - + > Un 
(11.11b) 
where we have to sum over all the 2” arrangements a8 . . . v of the 


n+ and — signs, and where w,(o,,) is abbreviated to u,, etc. 


Transformation properties of spin functions 
Since all our spin operators sz, ete., are defined in terms of infinitesi- 
mal rotations (11.6), before we can operate with them on a wave 
function we have to know how w,, u_ transform under rotations R. 
Now any function of oz can be expressed in terms of u,, u_. In 
particular this applies to Ry and also Ru, and Ru_. Thus we can 

always write 
Ru, = au, + bu_, 


11.13 
Ru_ = cu, + du_, ( ) 


so that u,, u_ transform into one another by rotations. They 
therefore form a basis of a representation of the rotation group, 
which can only be D?/?) or D +. D\ since it is two-dimensional. 
In the latter case uw, and u_ would be invariant under rotations and 
we would have no directional properties contrary to our assump- 
tion (11.1). Hence u,, u_ transform according to D\/), and u,, 
u_ are often distinguished by their m quantum number m, = +}. 
From this conclusion and (8.18), (8.35), we have 


8, = thu, 8u_ = —hhu_, (11.14) 


su, = 2hu,, su_ = Pu, 


so that u,, w_ correspond to states with a spin angular momentum 
of magnitude »/¢h with components +4h in the z-direction as stated 
at the beginning of the section. 


Term wave functions 

We first generalize the discussion of § 10 by including spin in 
all the wave functions. Since #esce (10.9) and Ho (10.2) do 
not depend on spin, their spin dependent eigenfunctions can be 
obtained by multiplying each of their orbital eigenfunctions by 
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any one of the 2” products u,,Ugy . - - Un, Where of... v is a set of 
+ and — signs as in (11.11b). Thus the eigenfunctions of Horp are 
yb = PLM, orb (r,, Lo, oe P'n)U Ugo ere Uyn- (11.15) 


These 2" functions are all degenerate, corresponding to the fact 
that # orp does not depend on oz; and is invariant under a rotation 
Repin ¢ (11.3) of the i“ spin co-ordinate alone. (Compare the degener- 
acy (10.8) of a configuration.) However, instead of using in (11.15) 


the simple products w,,Ugy . . . Uj, Which transform according to 
the reducible representation 
DU x Da) x... x DW) = > Ds), (11.16) 


we can first pick out the linear combinations Us, transforming 
according to the irreducible components D‘S). These can be written 
down using the Wigner coefficients as in (10.12). Then instead of 
(11.15), the eigenfunctions of #orp are written 


WL, M1, 8, Ms) = tim, orp Usmys (11.17) 


and for given Zand S they form a degenerate set of (2L + 1)(2S + 1) 
states called a term. This degeneracy corresponds to the fact that 
HA or» is invariant under a rotation of all spatial co-ordinates or of 
all spin co-ordinates simultaneously. If we combined a given 
im, orp With each of the Usy,, we would get 2" wave functions 
(11.17) which are just linear combinations of the functions (11.15). 
The reason for preferring the form (11.17) is that in fact the exclusion 
principle does not allow us to use in (11.17) for each L all values of S 
occurring in (11.16). In fact we shall see in the next section that a 
given value of Z gets combined with only one value of S in (11.16) 
and thus gives rise to only one term, not to several. In the spectro- 
scopic notation, a term is specified by a capital letter S, P, D, F, 
G, etc., corresponding to L = 0, 1, 2, 3, 4, analogously to (10.7). 
The value of S is indicated by the superscript 2S + 1. For instance 
a term with S = 1, ZL = 2 is written 3D. The superscript is pro- 
nounced “‘singlet”, “doublet”, “triplet”, etc. 


Splitting due to H spin 


What splittings occur if we now include # spin in the Hamilton- 
ian? We first have to derive the symmetry transformations of 
H spin. In §8 it was shown that if 


E=li + mj + nk 
is a unit vector along some axis 0£, then J, is given by 
I, = 1], + mI, + ndz. (11.18) 
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Thus if R is a transformation to new co-ordinate axes 0£, On, 0f 
where 


& = la + my + nz, 


R transforms Iz into I, (11.18), and we see that Iz, Iy, I, behave 
under rotations like the components of an ordinary vector. It 
follows from the definitions (8.28), (8.30), (8.33), (11.5), etc., that the 
components of the angular momenta s;, k, J, etc., all transform under 
rotations like the components of vectors, as we have anticipated by 
our notation. Now #spin contains products of vectors, for instance 
Ayes; (cf. (11.8) and (11.9)). The Hamiltonian is therefore no longer 
invariant under orbital and spin rotations separately. However, 
1,8; is invariant under a simultaneous rotation in both orbital and 
spin space, because it is the scalar product of two vectors and 
transforms according to the identity representation D, and the 
same applies to all the parts of #gpiy (11.8) and (11.9). Hence the 
eigenfunctions of Horn + H spin can be sorted out according to 
the representations D‘Y) of the combined-orbital-and-spin rotation 
group. 

In particular the term functions (11.17) transform according to 


DY) x DiS) = > DW) (11.192) 
under simultaneous orbital and spin rotations, where the values 
J=L+48,0+S8—1,...|L—S8| (11.19b) 


are given by (9.2). By Theorem 3 of § 6 we therefore expect the 
term to be split into separate levels, one for each value of J, and 
these are still (2J + 1)-fold degenerate. In the spectroscopic 
notation the value of J is indicated by a subscript, For example 
the spin-orbit coupling splits the term *D into the levels °D,, 
3D, and *D,. If we want to specify the level even more completely 
we write in front the configuration from which the term arises. 

The procedure just outlined is particularly profitable when the 
splittings produced by # spin are small compared with energy 
separations between terms. For then we obtain good approxi- 
mations to the eigenfunctions by using the Wigner coefficients to 
pick out from the term functions 4(Z, Mz, S, Ms) the linear 
combinations 


aL, S, J, M3) ae > (LSM iM 5| JIM 5) WL, M1, 8, Ms) 
aes (11.20) 


transforming according to DY). This situation is known as Russell- 
Saunders coupling and applies particularly to the light atoms. 
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However # orn + “spin is not invariant under separate orbital and 
spin rotations, so that L, Mz, S, Ms, are no longer good quantum 
numbers for characterizing a state. As regards the wave functions 
this means that the exact eigenfunctions are not just (11.20) but 
contain, in addition, contributions with the same J and M, from 
other terms with different L and S. In fact in the very heavy atoms 
(and likewise for nuclear forces), it happens that the effects of | 
H spin are larger than the splittings due to #¢eg, so that the wave 
functions are very mixed. One then obtains better zero order 
approximations to the correct eigenfunctions by following the 
scheme known as jj coupling. In the central self-consistent field 
approximation, all terms in (11.8) vanish except the ones which 
reduce to 


KH spin => > E(ra)le- Si, 
1 1dV;, (11.21) 
Qm2%c2r ar” 


where &(r) = 


This can be deduced directly from (11.8) or more simply as in 
equation (32.26) from the Dirac equation. We also neglect #55 
(11.9). With this approximation # oscr + #’spin is invariant under 
a simultaneous rotation of the space and spin co-ordinates of the 
separate electrons, and its eigenfunctions are just products 


Pimp (lis oz) Piymya(Vas Gz2) eee Pinmin ins zn): (11.22) 


Here the spin-dependent one-electron orbitals ym, transform 
according to D® x Div) => DY). The products (11.22) trans- 
form according to 


Di x Dis) > 3 DG) — > DY), 


and we can use the Wigner coefficients as in (10.12) to pick out the 
linear combinations ¥(j,, jo, . . - jn, J, Mz) transforming according 
to the DY). These will be the zero order wave functions when we 
split the configuration (11.22) by applying the remainder of the 
Hamiltonian, namely #eg + H spin — # spin, a8 a perturbation. 
There will of course be again some configurational mixing as in 
(10.13). 


Parity 


As already noted in § 3, # orp is invariant under the space inver- 
sion IT (3.11). Now JT commutes with every rotation R, TR = RIT. 
Hence if (%,, . .., on) is a vector space invariant under JT 
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and transforming irreducibly under rotations, the matrix D(IZ) 
representing JJ commutes with all the irreducible matrices D(R) 
representing the rotations. By Schur’s lemma, appendix D, we have 
D(II) = wH where E is the unit matrix. Since J]? is the identity 
transformation, we have w = 1 or w = —1 when the representation 
is single-valued, and corresponding functions % such that JT = % 
or If = — are said to have even or odd parity. As regards the 
functions u, and u_, Tu, and ITu_ must be expressible again in 
terms of u,, u_ as in (11.13), so that the vector space (u,, u_) is 
invariant under J7. However in this case it transforms according 
to the double-valued representation D“/®) under rotations. The 
identity transformation is represented by the two matrices HZ and 
— FE, so that 

ITu, = wu,, Tu, = wu_ (11.23a) 


where w = 1, —1, 7, or —t. It may be verified @ posteriors that all 
the results of physical significance which are derived using J], 
e.g. selection rules, are quite independent of which value of w is 
chosen in (11.23). Briefly the reason for this is that physically 
observable quantities contain products like ¥;*%; so that ITpb,*y; 
contains the factor (w*w)" where n is the number of electrons. 
This factor is always unity for each value of w (11.23). For simplicity 
therefore we choose 


| the spin functions w,, w_ to have even parity w = 1. | 


(11.23b): 


To determine the parity of the various wave functions we have 
written down, we first note that the spherical harmonic Yj, can 
be expressed as a polynomial of degree J in 2/r, y/r, z/r (§ 8). It 
therefore has parity (—1)?. Consequently the configurational wave 
functions (10.3) have parity 


| w = (—1) | (11.24) 


and the same applies to their linear combinations, the 4,4 , (10.12). 
Now since # orp is invariant under J7, its eigenfunctions have definite 
parity, so that the configurational mixing in (10.13) can only be 
between configurations with the same parity. When we include the 
spin functions Ugy, in (11.17), the parity is still (11.24) since we 
have already made the convention that all spin functions have 
even parity. #spin is also invariant under JJ, since all the 1; and 
S$ are (appendix F). Thus # op + #spin is invariant under JT, and 


7 
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its eigenfunctions ¥(L, S, J, Mz) (11.20) can be assigned a definite 
parity (11.24) which is determined by the configuration from which 
the level arises. This is true no matter how much mixing between 
configurations and terms there is in the wave functions. 


Summary 


The electron spin has been introduced and the property of spin 
angular momentum deduced. # orm is invariant under separate 
rotations of all orbital co-ordinates and of all spin co-ordinates, 
so that its eigenfunctions transform according to D‘ and Dts) 
under these rotations. Hence its energy levels, called terms, are 
labelled by quantum numbers L and S, and have a degeneracy of 
(2L + 1)(28 +1). Hom + Hszpin is invariant under simultaneous 
rotations of all orbital and spin co-ordinates, and we get sets of 
eigenfunctions transforming according to D) given by (11.19). 
A level characterized by J is (2J + 1)-fold degenerate. The total 
Hamiltonian is invariant under the space inversion JT, so that the 
parity given by (11.24) is also an exact quantum number. 


PROBLEMS 
11.1 Prove that szu, = }iu_, szu_ = }hu,. Show that if we 


write the wave function (11.10) as a column matrix 7" | then we 


have to write any one-electron operator as a 2 X 2 matrix. In 
particular show that the matrix for a one-electron Hamiltonian 
is obtained by multiplying the spin-independent part by the 2 x 2 
unit matrix 1 and by putting in the spin-dependent part 


lf. 1 ot. Ss 1 fl. 
oe = 54 eF oy = 5], ‘| se = 5H sat 


These matrices, without the factor 4%, are known as the Pauli spin 
matrices oz, oy, oz. Show that they have the property 


Czy = —IyVg = 102, etc., ox" = oy" = oz” — 
11.2 Express the function o,(%,u, + %_u_) in component 
1 : 
form, (i) when oz is considered as the matrix _] in the sense 


of problem 11.1, and (ii) when oz is regarded as a co-ordinate as in 
(11.1la). Hence show that the double meaning of o, can lead to 
no confusion. 

11.3 The normalized pure spin function wis such that wu = 1 when 
the € component of along the é-axis is 4%, where 0¢ makes spherical 
polar angles @, ¢ with the z-axis. Express u in terms of w,, u, 


—_? 
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11.4 It is found that an electron in a state y has equal proba- 
bilities of having its spin parallel or antiparallel to the z-axis at any 
particular point r. What form does % have, expressed in terms of 
u, and u_? 

11.5 What values of S can we have for an atom with three 
electrons? Write down the spin functions Usy, for S = $ (use 
the method of problem 10.3). 


L+s 
11.6 Show that > (2J +1) = (22 +4 1)(28 +1), and inter- 
IL—sI 


pret this with respect to the degeneracy of a term. 

11.7 Show that an S term (L = 0) does not necessarily have 
even parity. 

11.8 Starting with a particular configuration, show that one 
obtains the same J values using the Russell-Saunders coupling and 
the 77 coupling schemes. 

11.9 Show that the vectors r, and r, A r, respectively do and 
do not change sign under the space inversion J7. Such vectors are 
called regular vectors and pseudovectors respectively. Classify the 
following vectors into regular and pseudo: velocity, momentum, 
grad, angular momentum, spin, force, electric field, magnetic field, 
magnetic vector potential, magnetic moment. (Note that we define 
the vector product r, A r, to have the components y,z, — y92,, 
etc., in either a left-handed or a right-handed reference frame. 
Similarly, to obtain the transformation properties of electric and 
magnetic fields, we require that Maxwell’s equations be the same 
written for a left- or right-handed set of axes.) Show that the 
Hamiltonian for an atom is invariant under JZ when the interactions 
among the electrons and with the nucleus (§ 21) are included, and 
the interaction with a uniform magnetic field. Hence, deduce that 
parity is an exact quantum number for describing the state of an 
atom under these conditions, but that it would not be if the spin- 
orbit coupling had the form A2r;-s; (Lee and Yang 1956). Consider 
also an atom interacting with a general time-dependent electro- 
magnetic field and show that the parity of the whole system (atom, 
field and sources of the field) does not change with time (problem 
30.2). In particular verify this when the atom undergoes a radiative 
dipole transition (Schiff 1955, pp. 224, 246, 257). 


12. The Effect of the Exclusion Principle 


Antisymmetric wave functions 
So far we have not taken account of the fact that Horn + H# spin 
is invariant under the group PD, of permutations of the electron 
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co-ordinates. Let % be an eigenfunction of Hom or Horm + H spin. 
If P is any permutation of the co-ordinates rj, oz, then by §6 Px 
is also an eigenfunction with the same energy. This increases the 
degeneracies deduced in the last two sections by a factor of up to 
ni (factorial n) and we can generate an enormous number of degener- 
ate eigenfunetions. For instance, suppose we have obtained, using 
the Hartree equations, an approximate eigenfunction 


£ = diel i)Pop,m(lo)¢41U +2 


of #H orp, corresponding to the configuration 1ls2p of the helium 
atom. We have really a degenerate set of functions with the three 
allowed values of m = 1, 0, —1 and two values +4 each for m,,, 
Ms. By operating with P,,, the transformation that interchanges 
Lr Yi» 21» Fy ANd Le, Yq, 2%, Tzq, We generate a new function Pyy~ 
from each one of these, so that we have a total of 3 x 2% x 2 = 24 
degenerate functions. These can be sorted out according to the 
irreducible representations of P,. Clearly the linear combinations 


Ys = x de P pip, Y= — Pip (12.1) 


transform according to the symmetric and antisymmetric re- 
presentations respectively (§ 7). 

Now in §7 it was shown that for any number n of electrons, Pn 
always has an antisymmetric representation. The Pauli exclusion 
principle says that the only states occurring in nature are the ones 
with wave functions transforming according to these antisymmetric 
representations. This can be derived from the fact that electrons 
have spin } [see discussion above equation (32.27)]. In our example 
with two electrons, the exclusion principle eliminates half the pos- 
sible number of wave functions, namely those of the form Y, 
(12.1). When » > 2, the reduction in the number of wave functions 
is even more drastic, a fraction 1/n! being in general antisymmetric. 
Hence in making calculations for an atom with several electrons, 
it would appear to be a waste of effort to set up a large number of 
wave functions, only to reject most of them at the end because of 
the exclusion principle. Following Slater (1929), it is much better 
to restrict oneself to antisymmetric wave functions from the begin- 
ning. From any wave function ¥, we can always construct an anti- 
symmetric one ¥ analogously to (12.1), namely 


(12.2) 


» 
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Here the antisymmetrizing operator A means applying a permuta- 
tion P to %, multiplying by 8p = +1 as the permutation is even or 
odd (§ 7), summing over all P in the group Pn, and multiplying by 
the normalizing factor. When 7 is a simple product of one-electron 
functions, Y can be written as a determinant: more generally 
when # is a sum of products, ¥ is a sum of determinants. 


Configuration wave functions 


Let us now modify the discussion of §§ 10 and 11, and express it 
entirely in terms of antisymmetric wave functions. It is essential] 
to include spin from the beginning because the exclusion principle 
applies to permutations of all co-ordinates, not of the orbital ones 
alone. Thus analogously to (10.3), we write as our first approxima- 
tion to the wave function a single determinant 


PF = A g(t, )uer ball a)Uge - - - On(Pn)lvn 
Pilly)Uag Sa (Ty)U py... 
SS Te $4 (Lo) tro Pa(Le)U go et ee 


a/n} 


(12.3) 


As before «8... v is some arrangement of + signs. We first neglect 
H spin. Our ¥ (12.3) is too simple to be an eigenfunction of H orp, 
but we can define the expectation value of the energy H as in 
(10.4). We again obtain the best wave function subject to its 
having the form (12.3) by varying the dn so as to minimize Z. 
This gives a set of equations analogous to (10.5) from which the 
dn can be determined. These are the Hartree-Fock self-consistent 
field equations (Hartree 1957) 


he Z 2 2 
|- Cia ae ~ ot Z | $3*(¥4)b;(13) = ae] ils) 


2 
= > fl $y* (¥z)ba(Fy) = any| Py(Ts) = Higi(ri). —(12.4a) 
ill 

The electrostatic average potential appears as before. The last 
term on the left is the so-called exchange term which has no classical 
analogue, and in it the summation is only over electrons having a 
parallel spin direction, i.e. czy = az. A clear discussion of its physical 
significance has been given by Slater (1951). In making actual 
calculations with these equations, it is usual to average the potential 
and exchange terms over all directions if they are not already 
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spherically symmetrical. Then (12.4a) is invariant under rotations 
of the 2;, y:, 2: alone, and the wave functions have the form (10.6) 
transforming according to D), In this way the individual energies 
FE, and the total energy H of ¥ become independent of the my 
and ms; though they depend through the self-consistent field on the 
lj and the principal quantum numbers 7. It follows that the total 
energy of the wave function (12.3) for the whole atom is completely 
specified by the quantum numbers nj, ij. Such a level is called a 
configuration and has degeneracy 


2n(21, +.1)(Qly +1)... (2ln + 1). (12.4b) 


There is one important difference between the determinant VY 
(12.3) and the simple product Hartree wave function (10.3). In 
the latter there is no restriction on the quantum numbers 7, k, 
my chosen, but we notice from the determinantal form that Y 
vanishes if we put two electrons say ¢ and j into states with exactly 
the same quantum numbers 2, 1, mz, mg, i.e. if for instance ¢,(r)u, 
= ¢,(r)ug. This happens because the two columns 1 and j of the 
determinant become identical so that the determinant is zero. 
Hence applied to determinantal wave functions of the type (12.3), 
the exclusion principle takes the form that only one electron can 
occupy a particular orbital, designated by a particular set of quantum 
numbers n, l, mi, ms. This drastically limits the number of wave 
functions in a configuration. For example in the configuration 
(1s)?2s(2p)? any arrangement with m,, = mg =} OF Mg, = Meg 
= —} is excluded. Furthermore, from the antisymmetry of the 
wave function we have 


Pm, = 3; Me = —}$) = —P (mg, = —4, Meo = $), 


(where all other quantum numbers are supposed the same on both 
sides) since these two wave functions are just the same determinant 
except for an interchange of the first two columns. Thus the two 
combinations of quantum numbers do not give linearly independent 
functions. Similarly mm, =m, = 1, m,q = ms, = $ is excluded, 
but my, = 1, m5 = 0, Meg = Mg5 = § is allowed. 


The periodic table 


The exclusion principle not only limits the number of states in a 
configuration, but also limits the number of allowed configurations. 
For instance (1s)° is excluded because there are only two possible 
ls functions corresponding to m; = 0, ms = +}. Similarly there 
are two 2s functions (1 = 0), six 2p functions (J = 1), and, in general, 
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2(21 + 1) functions with principal quantum number 7 and orbital 
quantum number J. These make up respectively the 1s shell, 2 
shell, 2p shell, etc., as they are called. We can now write down the 
configurations corresponding to the states of lowest energy of the 
lightest atoms. In hydrogen the single electron goes into the state 
with lowest energy, namely 1s. In helium we have a second Is 
electron, which completes the 1s shell. In lithium the third electron 
must go into the orbital with the next lowest energy, namely 2s, 
and in this way we build up the configurations shown in Table 4. 


TABLE 4 
Configurations of the Elements 


H is Lis 1s? Qs Na 1s? 2s? 2% 38 

He 1s? Be = 182 2s? Mg 1s? 2s? 2p* 352 
B_ 1s? 2s? 2p Al 1s? 25% 2p® 352 3p 
Cc Is? 282 23 Si 1s? 2s? 2p% 352 2p2 
N 182 282 2p P 1s? 2s? 2p* 332 3p3 
O 1s? 2s? 2p 8 1s? 2s? 2p% 382 3p4 
F 1s? 28? 25" Cl 1s? 28? 2p* 332 3p 
Ne 1s? 2s? 28 A 1s? 2s? 2n® 382 3p 


eee 
We notice the similarity between the configurations of the sequences 
Li, Be,... , F, Ne and Na, Mg,...Cl, A. They differ only by the 
closed shells (2s)2(2p)* which is a very stable structure, as shown 
by the inertness of neon, compared with the more loosely bound 
additional electrons which have to go into orbitals with not nearly 
such a low energy. The latter give rise to the chemical properties 
of the elements and are called valence electrons. In this way the 
whole periodic classification of the elements can be accounted for 
(see for instance Slater 1939, p. 344). 


Enumeration of terms 


Asin § 11, all the determinantal wave functions ¥ (12.3) belonging 
to one configuration transform under orbital rotations according to 


Dw) x Dl) x 12. x Dn — > Dh (12.5a) 


and under rotations in spin space according to 


DO x DA) x... x DA) = > Ds), (12.5b) 


Using the Wigner coefficients we can reduce the vector space with 
respect to both these groups and construct a set of (2D + 1)(28 + 1) 
linear combinations Y(L, Mz, S, M s). These transform according 
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to D‘) and D'S) under orbital and spin rotations respectively, and 
one such set (or its energy) is called a term as in §11. Taking linear 
combinations of determinants keeps the wave function antisym- 
metric of course. 

Following Slater (1929) we can enumerate all the terms from a 
given configuration which are allowed by the exclusion principle. 
This is done by counting in a way similar to § 9 all the values of Mz, 
and Ms that occur, and then grouping them into complete terms. 
Applying a rotation to all orbital variables keeping the oz fixed, 
we have from (12.3), (10.6) 


Ror (¢, 2) ¥ = exp(i¢d > my) ¥ = exp(tMz¢4) ¥. 


Mr, => mu, Ms = > msi. (12.5c) 


In a configuration such as (1s)?2s(2p)?, a full shell like (1s)? never 
gives any contribution to Mz, and Ms. This is because the states in 
one shell can be grouped in pairs with quantum numbers m, mz 
and —m), —ms,, and in a full shell they are all occupied. Let us 
therefore designate a state of the configuration (1s)?2s(2p)? by 
writing down (ni, 4, mu, msi) for each of the three electrons (i.e. 
2s(2p)?) outside the closed shell. In this way we can systematically 
write down all the states allowed by the exclusion principle. They 
are shown in Table 5, where + denotes +4 and where all states 
with negative M, or Ms are omitted. These are simply given by 
symmetry from the states with both M , and Msg positive. To 
group the values of Mz, Mg into terms, we pick out the highest 
value of Mz with the highest Ms accompanying it, namely Mz 
=2, Ms =}. From the corresponding wave function, we can 
generate with I_orp, I_spin all the wave functions belonging to a 
term with L = 2, S =}, ie. a ?D term. This uses up the Mz, 
Mg values marked (a) in Table 5 and corresponding negative values. 
Continuing the process among the remaining ones, we obtain also 
the other terms given in Table 5. 

Two orbitals having the same / are said to be equivalent if they 
also have the same n. If in the above example we had taken the 
two p orbitals to be inequivalent, e.g. the configuration (1s)"2s2p3p, 
there would have been some extra allowed entries in Table 5. For 
instance the determinant (200 +)(211-+)(311 +) in the con- 
figuration 2s2p3p is non-zero, whereas the determinant (200 +) 
(211 +)(211 +) in the configuration 2s(2p)? vanishes. 

As another example consider the configuration (2p)*. Corre- 
sponding to any given set of quantum numbers, e.g. (211 +) 


Thus 
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TABLE 5 
Enumeration of Wavefunctions in the Configuration 

(1s)?28(2p)? 

M, M, Termf 
(200+ )(211 + )(211—) 2 1/2 & 
(200 + )(211+ (210+) l 3/2 b 
(200 + )(211 + )(210—) i 1/2 a 
(200 + (2114 )(21—1+) 0 3/2 b 
(200 + )(211+4 (21 —1—) 0 1/2 a 
(200 -+)(211 —)(210+) l 1/2 b 
(200+ (211 —)(21—1+) 0 1/2 b 
(200 + )(210 + )(210 —) 0 1/2 d 
(200 —)(211 + (210+) l 1/2 © 
(200 —)(211+)(21—1+) 0 1/2 c 
+ (a) 3D; L = 2,8 = 1/2. (b) *P; L = 1, S = 3/2. 

(co) *P; L = 1,8 = 1/2. (d) *S; L = 0, S = 1/2. 


(211 —)(210+)21 —1—), with total M quantum numbers 
Mia, Msa, we can write down the remaining two orbitals (2 10 — ) 
(21 —1 +) required to fill the shell. These have total quantum 
numbers Mzp = —Mzia, Msp = —Msa, because the full shell 
has M; = Ms = 0. Since in any term the values +My, and +Ms 
occur in pairs, the terms that can be formed from the configuration 
(2p)* are the same as those from (2p)*. Similarly (2p)° and 2p give 
the same terms, and so do (nd)* and (nd)}°-*, etc. 

Table 6 lists the terms from some configurations which consist 
only of equivalent electrons outside a full shell. A configuration 
composed only of closed shells has only the one state with Mz 
= Ms = 0, ie. it gives a 18 term. 

The discussion of parity in § 11 is unaffected by the exclusion 
principle, and the parity (11.24) applies to all states derived from 
one configuration. 


TABLE 6 
Terms from Simple Configurations 


8 4 28, 


s*, p* or d1° : 1s. 

p or p® ; *P. 

p?* or p* : ®P, 1D, 18. 

2 ; “8, 2D, *P 

d or d® : 2D. 

d® or d® ; SF, ap, 1q, 1p, 1g. 
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Energy levels 


As already mentioned, with some approximations in the Hartree- 
Fock equations (12.4), all the determinantal wave functions 
(12.3) belonging to one configuration have the same energy.t 
Now # orm is invariant under separate rotations of all orbital co- 
ordinates and of all spin co-ordinates. Hence the eigenfunctions can 
be grouped into terms transforming according to D and DS) 
(12.5), different terms in this approximation having different 
energies. The wave functions are approximately linear combinations 
of determinants (12.3) from the configuration to which the terms 
belong, but strictly they include also small contributions from 
other configurations. These contributions must come from terms 
with the same L and S so that the whole wave functions still trans- 
form precisely according to D and D'S). This limits the amount 
of configurational interaction considerably, and there are further 
restrictions, e.g. the configurations must have the same parity. 

For a detailed discussion of the calculation of the term energies 
we refer to Condon and Shortley (1951, p. 191). However, we can 
see qualitatively why different terms belonging to the same con- 
figuration have different energies. Terms with different L have 
different spatial arrangements of their charge density, and so 
differ in electrostatic interaction energy. But how do terms with 
the same L but different S differ in energy when # orp does not 
depend on spin co-ordinates, e.g. +P and 2P of Table 5? The answer 
is that the exclusion principle is not spin-independent; it demands 
antisymmetry in all the co-ordinates including the oz. Consider a 
determinant Y (12.3) in which two electrons i and j have parallel 
spin, i.e. msi = mej. Then for both electrons at the same point 
ry = 1; we have 

PyP (ri = rj) = Pri = 13). (12.6) 


However by the exclusion principle Py’ = —¥, so that we must 
have P(r; = rj) = 0. Le. there is zero probability of finding the two 
electrons with the same spin simultaneously at the same point. Clearly 

{ There is actually no single Hamiltonian analogous to F nace (10.9), Of 
which the Hartree-Fock determinants are the eigenfunctions. However by 
further approximating to the exchange terms in (12.4), Slater has derived a 
central field Hamiltonian of the form 


2 
# sister = — 2 Veit D> Viri) 


with determinant eigenfunctions. These approximate closely to the Hartree— 
Fock wave functions, and for one configuration all have the same energy 
eigenvalue (Slater 1951, Pratt 1952). 
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(12.6) does not hold for electrons with different spin directions, and 
for these ¥ + 0 for rz; =1;. Thus the exclusion principle auto- 
matically keeps electrons with parallel spins out of each other’s 
way, which lowers their electrostatic interaction energy. Of all 
the states in. Table 5, only the one with Ms = 3/2 has all three 
electron spins parallel, and we expect it therefore to have the lowest 
energy. Of course, the other states belonging to the same term 
must have the same low energy. More generally, in any configuration, 
the term with the lowest energy has the maximum number of parallel 
spins, i.e. it has the largest S. Subject to this condition it also has 
the largest L possible. This is Hund’s rule. The LZ rule follows by 
considering the state with maximum M,;. It consists of deter- 
minants with mainly positive values of m,;. The angular momenta 
of the electrons tend to be lined up so that the electrons are going 
round the nucleus in the same direction. This allows them to keep 
as far apart as possible on opposite sides of the nucleus as they 
revolve with a consequent lowering of their electrostatic energy. 
On the other hand electrons going round the nucleus in opposite 
directions (opposite m;) meet one another every revolution, so that 
they are close together for a considerable fraction of the time, and 
this raises the electrostatic energy. These rules only apply rigorously 
to the lowest term of a configuration. 

Asin § 11, when we include # spin in the Hamiltonian, each term 
is split according to 


DY x DiS) = > DY) (12.7) 


into levels characterized by the quantum numbers J, My; and 
parity. In light atoms the effect of spin is small compared with 
the differences between terms (Russell-Saunders coupling case). 
Using Wigner coefficients the zero order wave functions Y(L, 8S, 
J, Mj) can be written down, but the exact eigenfunctions contain 
admixtures from other terms. The level with the minimum J has 
lowest energy, unless the configuration contains a partly filled shell 
more than half full, in which case the lowest energy is given by the 
maximum value of J. Fig. 8 gives a sketch of the levels of the con- 
figuration (1s)?2s(2p), the levels from other configurations having 
too high or too low energy to be included in the diagram. 


Summary 


The exclusion principle requires all physically applicable wave 
functions to be antisymmetric with respect to permutations of the 
orbital and spin co-ordinates together. This drastically limits the 


98 GROUP THEORY IN QUANTUM MECHANICS 
Energy 2p 


is¥2s(2p)" 


ls J > 3 
= esd J s 34 
J =e 
Confiquration Terms Levels 
Self-consistent Split by Split by Split into 
field approximation electrostatic spin-orbit 2/+1 sublevels 
interaction coupling by magnetic 
field 


Fia. 8. Energy levels of the configuration (1s)* 28 (2p)?. 


number of allowed configurations and the degeneracy of each con- 
figuration. Systematic ways of deriving the allowed configurations 
and terms have been developed. 


PROBLEMS 


12.1 What is the lowest level of the Mn++ ion? What con- 
figurations could be admixed with it? The ground state configuration 
is (3d)5. 

12.2 What terms arise from the configurations (2p), (2p)?3p, 
and 2p3p4p? 

12.3. A configuration consists of two partly filled shells. Show 
how its terms may be derived by first obtaining the terms from the 
two shells separately and then combining them. 

12.4 Write down the spin functions Ugy, (11.17) for two 
electrons, and show that the S = 1 and S = 0 ones are respectively 
symmetric and antisymmetric with respect to permutations. Hence 
show that all wave functions ¥(L, Mz, S, Mg) of helium terms can 
be written as a symmetric (or antisymmetric) orbital wave function 
times an antisymmetric (or symmetric) spin function. Hence set 
up all the terms arising from low lying configurations of helium and 
draw a rough energy level diagram (Van der Waerden 1932, p. 111). 
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Discuss how an analogous (but more complicated!) procedure 
can be followed for more than two electrons (see § 28; also Wigner 
1931, p. 270). 

12.5 Illustrate the argument of equation (12.6) with a simple 
two-electron wave function. 

12.6 Write down the Hamiltonian # for an atom in a uniform 
magnetic field in the z-direction (Schiff 1955, pp. 292, 293). Show 
that # is invariant under rotations about the z-axis, and that we 
would expect a level characterized by the quantum number J to 
be split into 2J + 1 levels. Are J and M, exact quantum numbers 
in the presence of the field? 


13. Calculating Matrix Elements and Selection Rules 


Matrix elements 


In quantum mechanics all physically measurable quantities 
appear as matrix elements or are expressible in terms of them. A 
matrix element is an expression of the form+ 


My = | w*Qby dr (13.1) 


involving an operator Q and two wave functions yy, 4;. For example, 
if we are calculating energy levels, the operator is the Hamiltonian 
or some part of the Hamiltonian that is being applied as a perturba- 
tion. Also the transition probabilities between energy levels with 
the emission or absorption of radiation, and the expansions for 
wave functions, all involve matrix elements. In this section we 
shall establish selection rules which state which matrix elements 
are non-zero. We shall also show how non-zero matrix elements 
for different i and j can be related to one another and calculated. 

We may assume without loss of generality that ¥; and ¥; belong 
to an orthogonal normalized set of functions %, which transform 
in a definite way according to the irreducible representations of 
some group. That is, 


[enn dr = Omns (13.2) 


where the Kronecker 5, is defined by (A.15). We now expand 
Quy; in terms of the complete set yy: 


Qh; = 2 Mn bn. (13.3) 


+ If ds, 4; are spin-dependent functions, we shall define the symbol f... dt 
to include also a summation over the values +1 of all spin co-ordinates az, 
i.e. setting u,*ug = 5, for each electron. 
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Multiplying by ¥4* and integrating, we obtain using (13.2) that the 
expansion coefficient My is just the matrix element (13.1). Conversely 
if we have an arbitrary wave function ¢; and expand it in terms of 
a complete set of functions y¥,;, then the expansion coefficients are 
just the matrix elements [y,*¢; dr. In what follows we shall be 
constantly alternating between the two points of view (matrix 
element and expansion coefficient), according to which is more 
helpful for the purpose at hand. 


Fundamental theorem 


Instead of specifying the %» by the single suffix n, let us sort 
them out to transform according to the irreducible representations 
D™ of some group © Now D™ will occur several times, for 
example among the eigenfunctions of the hydrogen atom we have 
2p, 3p, 4p, . . . functions all transforming according to the re- 
presentation D of the rotation group, and we shall use the index 
r to distinguish the r*® set of functions transforming according to 
D, The different functions inside one such set will be distinguished 
by a subscript such as 7 or j, and we shall choose the functions 
always in some conventionally chosen standard way so that the 
particular representations D“") with different r are not only equiva- 
lent but completely identical. Instead of the Qu; of (13.3), we start 
by considering some set of functions ¢;“ transforming according 
to D™ under G, and expand them in terms of the ’s. 


bi = S>> My . (13.4) 


ryt 


Here the M’s are now the matrix elements 
My) = fun hy dr. (13.5) 


Now the right-hand side of (13.4) has to transform in the same 
way as the left-hand side, and this suggests that the only yj" 
which can appear on the right-hand side of (13.4) are those that trans- 
form according to the same row j of the same irreducible representation 
D® as 6; ttself. I.e. (13.4) reduces to 


$9 = 2 yy "7. (13.6) 


This result is indeed so plausible and well known in particular 
instances (e.g. in Fourier analysis an even function can be expanded 
in terms of cosines alone without any sine terms), that we have 
already used it implicitly in (10.13) ete. The result can be 


QUANTUM THEORY OF A FREE ATOM - 101 


re-expressed in terms of the Mj;““", to give what we shall refer to 
as the fundamental theorem of this section. 

TA. When p 4A, Mi" =0. T.e. there can be no contribution 
to (13.4) from sets of functions not transforming according to D”, 

TB. When p =A, My" =ay,oy. Ie. inside one set ,°), the 
matrix M,; is a multiple ay, of the unit matrix. 

Proof. So far the reasoning has been purely heuristic. The 
proper proof of the theorem is as follows. If 7’ is any transformation 
of G, T¢;™ can be written in two ways, 


Tb) = SD My Di) da”, 
ur ik 
and Thy = > Dy®(T) di™ 
) 


=! >> Dy™(T) My”) fy (4"), 
ur tk 


Comparing these two, we see that the matrix M;,;‘” for each yp, 
r satisfies Schur’s lemma (appendix D), from which the results 
TA and IB above follow immediately. Q.E.D. 

For most applications, we require two extensions of the theorem. 
Suppose that instead of the 4; in (13.4) and (13.6), we have linear 
combinations 


O = ¥ Py di™ 
t 


which do not transform in the standard manner adopted for the 
di” and y,“". From (13.4) and (13.6) we have 


@;") = >>> My py (er) 


ryt 
= DD Or Pay ye, 
rt 
i.e. My,” = Sy Are P 45. 


Thus the matrix elements are determined uniquely by the Py, 
apart from the factor a,,5,, which is a constant for each set of 
y's. (By a set of %’s we mean the y¥;'"), 71 = 1, 2, . . . forming a set 
of base vectors for the irreducible representation D™.) Further- 
more the Py determine the transformation properties of the ®,” 
as shown explicitly by (5.15), and conversely the transformation 
properties uniquely determine the coefficients Py as can easily 
be proved in the manner of problem D.2. We therefore have 
symbolically: 


transformation matrix elements M;;‘“") 
properties of —+> Py — < apart from constant 
the B, factor. 
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Similarly if the ~’s being used do not transform in the standard 
way, then this can be taken into account in the same manner. 
This gives the first extension of the fundamental theorem: 

ITA. If the ®; and yy'"") have not been chosen to transform in the 
standard way but in some other manner, then IA still holds. IIB. 
Also the Mi“) are completely determined as regards their dependence 
on t and j by the transformation properties of the B;” and y,'), 
except for a constant factor a,,. 

A further extension is obtained by taking linear combinations 


D, = D> Pay di 
+ yu 


of ¢’s from several irreducible representations. These are again 
expanded in terms of the ~’s; 


OD; = SDD Ma i”. (13.7) 
rept 


By using the substitution (13.6) and arguments analogous to the 
ones above, we easily deduce: 

IITA. If in (13.7) the D; on the left transform according to a reducible 
representation D@™ + Di) + ... + D®, then My =0 unless 
p=a, B,...ore. IIIB. Whenpw =a, B,... or e, then the matrix 
elements My") are completely determined as regards dependence 
on 1 and j by the transformation properties of the ®’s and W's, except 
for a constant factor a, if «, 8B... «are all different. If D™, Di? , . 
D are not all different and D') occurs among them n times, then the 


My” involve n undetermined constants Ay,’ yy, A" wy, -... The latter 
result comes about because the linear combinations ®; contain 
terms from the x sets of functions ¢;), 6's), 67), ... , so that from 


(13.6) the matrix elements involve the constants @,,, @’ yp, @ ups +++ 
When we describe these constants as “‘undetermined’’, we mean 
that they are not determined by symmetry properties alone, but 
depend on the detailed form of the functions involved. 

As stated at the beginning of this section, the most common 
application of these theorems is to derive selection rules for the 
matrix elements 


| My = f yo Ou yy de, | (13.8a) 


where we suppose ¥;”, Q,™ and y, transform respectively 
according to D™, D™ and D™, From (13.3), these matrix elements 
are obtained by expanding Q;‘“)%,") which transforms according 
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to D x D™, in terms of the 4; Theorem IIIA and B therefore 
gives: 

A. In (13.8a) My ts non-zero only if the reduction 
of D™ x D™ contains D®, 
B. If D™ is contained once (or n times), then the (13.8b) 


My are uniquely determined by the symmetry pro- 
perties of the fs and Q’s, apart from a constant 
factor (or n undetermined constants). 


In (13.8b) the representation D® must be irreducible, but D™ 
and D) need not be. Alternatively the matrix elements (13.8a) 
are given by expanding the whole integrand ¥j)*Q,;; in 
terms of any complete set of functions, the first of which is the 
constant unity which of course always transforms according to 
the identity representation. Theorem IIIA and B then gives the 
selection rule 


A. In (13.8a) My ts non-zero only if the reduction 

of D®* x D® x D” contains the identity re- 
presentation. 

B. If the identity representation is contained once _(13.8c) 
(or n times), then the My are uniquely determined by 

the symmetry properties of the {’s and Q’s, except 

for a constant factor (or n undetermined constants). 


In the latter formulation, none of the representations D”, D™, 
D need necessarily be irreducible, though usually they will be. 
An alternative proof of the important results above will be given — 
in § 14 using group characters. 


Electric-dipole transition selection rules 


An atom in one quantum state can make a transition to another 
quantum state either spontaneously with the emission of radiation, 
or under the influence of externally applied radiation. We shall 
restrict ourselves to the type of transition described as electric- 
dipole, and the spontaneous and induced transition probabilities 
are then proportional to the modulus squared of the matrix element 
(Schiff 1949, p. 247) 


<ilrald> = fet ray dr. (13.9) 
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Here 


m= —vVt > (tn + Yn), rs=vt 2 (tn — tyn), 
ro = >. 2n, (13.10) 


and yy and yy; are the final and initial states. The summations are over 
all electrons in the atom. Also « = 0 applies to radiation linearly 
polarized along the z-direction, and a =1, —1 give the proba- 
bilities for radiation circularly polarized about the z-axis. In the 
Zeeman effect (problem 13.6) the linearly and circularly polarized 
radiation are known as the 7 and o components respectively. 

It is now simple to write down selection rules stating when 
<t|r.|j> can be non-zero. In § 12 it was shown that degenerate 
eigenfunctions of an atom transform according to the representa- 
tions DY) of the rotation group. Suppose y¥:, %; belong to sets 
with J = J,, J, respectively. Then the r,, transform according to 


DY x DUY — DU) 4 DV) 4 DUeVifJ,> 1. (13.11) 
From (13.8b) above, <i|r,|j> = 0 unless J, = J, + 1, J, or Jz—1. 
However, when J, = 0, then D™ x D® = D™, 30 that only 


J, = 1 is allowed and the transition J, = 0 to J, = 0 is impossible. 
We can summarize the results by the selection rule: 


AJ =0, 1 or —1, except 0 > 0. (13.12) 


Suppose now that y;, 4; in (13.9) transform according to the rep- 
resentations M,,, M,, of the axial rotation group about the z-axis. 
The r, transform with M=1, 0, —1 for « =1, 0, —1 and the 
whole matrix element transforms with M = —M,, +«+ Myz. 
We have the selection rule from (13.8c) 


AM, = +1, circularly polarized radiation, (« = +1), 


AM, = 0, linearly polarized radiation, (a = 0). 


(13.13) 


Similarly for <é|r,|j7> to be non-zero, the initial and final levels 
must have opposite parity (11.24). 

In the case of Russell-Saunders coupling (§§ 11 and 12), we 
derived in § 12 approximate eigenfunctions ¥(L, S, J, Mz) trans- 


+ We shall not follow any more the notation of § 12 as regards the use of a 
capital Y for antisymmetric wave functions. All wave functions are now 
aasumed to be antisymmetrio. 
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forming according to D‘) and D‘*) under orbital and spin rotations. 
The r, have LT =1 and S = 0, so that analogously to (13.12) we 
have 


AL =0,1 or —1 except 0 —> 0, 


13. 
AS =0. Ee 


However, since LZ and S are not good quantum numbers, this selec- 
tion rule only distinguishes large from small matrix elements. 


Transition probabilities 

So far we have used what amounts to part A of our theorem to 
sort out the non-zero matrix elements. We can now use part B to 
calculate the relative intensities of several related transitions. 
Consider the matrix elements between two energy levels with eigen- 
functions ¥; = ¥(J = 2, M,) and ¥; = ¥(J = 1, M,). From (13.8), 
all matrix elements between these two levels are completely deter- 
mined apart from one common numerical factor by the transforma- 
tion properties of the (J = 2, M,), the r, and the ¥(J = 1, M,). 
Hence to calculate the relative values of these matrix elements, 
we can replace these functions by the functions Uy,,'?), u,, vg, of 
(9.4) which transform in exactly the same way. From (9.6) 


[ Ty ™*ugo de = V4/Ny, 

f U,'?)*ugv, dr = 0, 

J Uy! *ugvy dr = +/§/No, ete. 
whence we obtain the relative magnitudes of all the matrix elements 
for « = 0 (Table 7). Those for « = +1 may be obtained similarly. 


More generally when y:, 4; transform according to DY), DV», 


TABLE 7 
Relative Values of (J = 2, M,|ro|J = 1, M,> 


o 
ooo 
o 
= 
moo 
eco 
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the matrix elements <J,M),|r,|J_Jf7.> between a particular pair 
of levels are proportional to the Wigner coefficients . 


GM yy, |7.|F2M y2> (lJ 90M 72 |J,M 53), (13.15) 


as follows from (9.8) by the same argument. 

Although (13.15) determines the relative values of the matrix 
elements completely, we shall now calculate the ones for 4J = 0 
transitions (13.12) in an alternative way purely to illustrate the 
use of the fundamental theorem. The 4, ¥; are now two sets of 
functions 4, and ¢m, say, both transforming according to DY) 
but belonging to different energy levels. In § 11 it was shown from 
(11.18) that the infinitesimal rotation operators Iz, Iy, Jz transform 
under rotations in the same way as 2, y, z. Consequently from 
(8.21) the J,, 


I, = —U2z + tly), I, = V2lz,; IL, =1, — ty, (13.16 


transform in the same way as the r, (13.10). For purposes of cal- 
culating the matrix elements, we therefore replace $y,, Ta, Pat, 
by ¢u, J. mu, Taking 


|<IM, |r. |FM,> |? oc || bui* 1.2 47|*, 
we obtain from (8.18) the relative transition probabilities: 
a=IM+1 —->4U4M; rel. prob. = J(J + 1) — M(M +1); 


a=0,M +> M; rel. prob. = 2M?; 
a=—l,M—1—-UM; rel. prob. = J(J + 1) — M(M — 1). 


Perturbation theory 

As in § 6, let # =H +H, be a Hamiltonian consisting of a 
major or unperturbed part #, and a small perturbation #. If 
the energy levels Hor of #4 are all non-degenerate, the energy levels 
of # are (Schiff 1955, p. 153) 


r|%, KH; 
B, = Bor + r|aylry + >, ele 5 


s4r 


(18.17) 


Here we use the Dirac notation (13.9) and <r|#p|s> is a matrix 
element of % with respect to the eigenfunctions ¢,, ps of Ho. 
The three terms give the energy correct to the zeroth, first and 
second order of perturbation respectively. 
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Now Schiff 1955 (p. 155) shows that (13.17) does not apply and 
perturbation theory becomes at first sight very complicated, when 
the unperturbed energy levels of #, are degenerate as they usually 
are. The reason is that one in general gets zero energy denominators 
in (13.17). To avoid these, it is necessary to choose such linear 
combinations ¢,, $s of the eigenfunctions % of #>, that 


| br* Hobe dr = 0 when Eo = Erg. (13.18) 


Then formula (13.17) becomes correct again using matrix elements 
with respect to the ¢;. Fortunately group theory and our funda- 
mental theorem help us considerably to sort out the eigenfunctions 
of #9 such that (13.18) is satisfied. Let G& be the group of all sym- 
metry transformations of #7. H#H> is invariant under © so that 
Hy, transforms in exactly the same way as ys. Hence if we sort 
out the ¥’s according to the irreducible representations of © and 
use the indices p, s, i, etc. of (13.4), we have from results IA and IB 
that 

<Ari |p |wsj> = Sy Siz Ors: (13.19) 


Thus (13.19) is automatically satisfied unless an irreducible com- 
ponent D occurs more than once in the vector space of one degener- 
ate energy level of #4. This happens infrequently: if it does, one 
has no option but to solve an xX » secular equation, where 7 is 
the number of times D” occurs in a particular unperturbed level. 
Hence we can usually obtain the correct energy levels by applying 
the ordinary non-degenerate perturbation theory (13.17) to each 
set of states 4%”) having the same X and 1. 


Relative splittings due to spin-orbit coupling 

We shall now calculate to first order the energies of the split 
levels when we apply the spin-orbit coupling Hz s (11.8) as a per- 
turbation to the degenerate states of one term. The eigenfunctions 
of a term can be sorted out according to either the quantum numbers 
L, S, Mz, Ms or L, S, J, Mz, where L and S are fixed for one term 
and the others take on several sets of values. For the present we 
shall use the former classification. Now the angular momenta 
liz and L, transform in the same way under orbital rotations and 
spin rotations (they are invariant under the latter). Hence (13.8) 
implies that the matrix elements of I, and DL, inside one term are 
proportional: 


(LSM 1M 5|liz| LSM',_M's> oc <LSM LM 5| Lz| LSM'1 M's. 
Similarly the matrix elements of (r~¢ A v;)z and Lz are proportional; 
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so are those of sz, and Sz, and those of r; A vj:s,andL-:S. Thus 
the whole of # zs (11.8) can be written effectively as 


as regards the vector space of one term, where ¢ is a constant. 
Since J = L + S, (13.20) can also be written 


KH 15 = $C(LS)(J? — L? — S?). (13.21) 


In order to apply the perturbation (13.21) to our term wave 
functions, we first have to sort out the eigenfunctions ¥(LSJM) 
which transform according to D“) under rotations, so that (13.18) is 
satisfied. Then from (13.17) and (8.35), 


E(LSJM s) = Eo + 3(LS)JU + 1) — LL + 1) — 88 + 1), 
(13.22) 
where EH, is the energy of the term without spin-orbit splitting. 


Absolute values of spin-orbit splittings 


The fundamental theorem only establishes the relative magnitudes 
of all matrix elements between one pair of vector spaces. For 
instance in the case of the spin-orbit coupling, to calculate the 
absolute values of the splittings (13.22) we still have to determine 
the ¢(ZS). This is not really a group theoretical problem, but 
since it occurs in any application of the theory, we shall show as an 
example how the ¢(ZS) are calculated. In principle there is no diffi- - 
culty. In § 12 we have shown how to write down (neglecting con- 
figurational and inter-term mixing) the wave functions ¥(LSJM),) 
in terms of Wigner coefficients and the one-electron orbitals ¢4. 
It would only be necessary to calculate the energy of one such state 
in each term to fix ¢(LS). In practice these wave functions are very 
complicated and it is easier to proceed slightly differently. We 
calculate the value of any convenient matrix element or sum of 
matrix elements in two ways, once from (13.20) and once from the 
original Hamiltonian (11.21), 


Hsin = Dd, E(rale “8, (11.21) 


and equate them. This gives some simultaneous equations for the 
required quantities, in our case the ¢(ZS). 
To apply this method in detail, (13.20) is written 


H is = C(LS)L +S = &(LS)($L,8_ + LS, + L8z). (13.23) 
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Let us calculate its diagonal matrix elementt with respect to 
¥(LSM1,Ms). Operating on this wave function with LS,, we 
generate using (8.18) the wave function with M ‘s=Ms+]1, 
M'; = M,—1 which is orthogonal to ~(ZSM;Ms). Hence in 
(13.23) only the last term contributes to the matrix element and 


(LSM 1M 3|415|LS8SM_M 5) = ((LS)M_LM s. (13.24) 


Similarly from (11.21), (12.2), (12.3), we obtain for the matrix 
element with respect to a single determinant wave function 


Cnlimuyme | 15 |nlymiyms> = 2, End Mime, (13.25) 


Cnty = ar | Sng PEP) fagiy(rr? dr. (13.26) 
0 


Here f is the radial part of a one-electron orbital as in (10.6), so 
that (13.26) can be calculated once the Hartree-Fock solutions are 
known. We note from (13.25) that a closed shell of electrons does 
not contribute to the matrix elements of #5. 

Consider now the specific configuration 2p3p. Its terms may be 
found analogously to Table 5, and they are *D, ®P, 1D, 1P, 38, 18. 
In the *D term, the wave function ~(L=2, S=1, Mz =2, 
Ms =1) consists of a single determinant (211+)(311-+) in 
the notation of Table 5. Thus from (13.24), {13.25), we have 


20(°D) = Hay + Bap. (13.27) 


We cannot determine the other {(LS) so simply, and have to 
make use of a theorem called the sum rule. Let #;, 1 = 1 to n, be 
a set of wave functions and let ¢, = ZC,,%, be n mutually orthogonal 
linear combinations. Further let My, M’., be matrix elements 
of any operator with respect to the y,, ¢,. Then in matrix notation 
M’ =C*MC. If },, ¢, are normalized, Sutd, dr =f 9,*6,d7 = 1, 
so that O*C = E and C is unitary. Applying the result of problem 
A.7 (appendix A), we have that the sum of the diagonal matrix 
elements M’,,, 18 equal to the swm of the My. 

In our case the two wave functions with M, =1, Ms = 1 of 
the *D and *P terms are linear combinations of the two deter- 
minants (211 +)(310-+) and (210-+)(311-+). Hence cal- 
culating the diagonal matrix elements (13.24) and (13.25) and using 
the sum rule, we obtain 


£(*D) + £(°P) = bap + Bop: (13.28) 
ft The matrix element fyi*Qy; dt is said to be diagonal if x = yj. 


110 GROUP THEORY IN QUANTUM MECHANICS 


Similarly, simultaneous equations for all the ¢(LS) are derived. 
In particular from (13.27), (13.28) we have 


C(°D) = £(2P) = bop + Bop 


Hence from (13.22) the total splitting of the 3D levels is 5/3 times 
the spread of the 3P levels, a conclusion which is very amenable 
to experimental confirmation. 


Summary 


A fundamental theorem has been established, which in essence 
states the following. If some functions ¢; are expanded in terms of 
a complete set of functions y¥,, and if each set has been sorted out 
according to the irreducible representations of a group, then each 
of the terms appearing in the expansion must transform in the same 
way as the ¢; being expanded. The expansion coefficients are matrix 
elements, many of which are therefore zero. Selection rules indicate 
which matrix elements are non-zero, and their relative magnitudes 
have also been calculated. This has been applied to electric-dipole 
transitions, degenerate perturbation theory, and level splittings 
due to spin-orbit coupling. The absolute magnitudes of the matrix 
elements can only be determined by evaluating one of them, or a 
sum of several, explicitly. 


PROBLEMS 


13.1 Complete Table 7 for 7, and r_,. Hence verify that the 
total intensity radiated in all directions is the same from each of 
the initial levels J = 2, M,. Prove this result in general for transitions 
from the 2J + 1 different states of one level. The result is not 
unexpected because the various M, states only differ in the orienta- 
tion of the angular momentum, which should not affect the total 
intensity. 

13.2 In (13.9), (13.10), —e2an, —eXyn, —eX2m is the electric 
dipole moment operator. To obtain magnetic-dipole transition 
probabilities, it has to be replaced by the total magnetic moment 
(—e/2me)(L + 28). Calculate the selection rules for magnetic 
dipole transitions, and also those for electric quadrupole transitions 
(Schiff 1955, pp. 253, 292; Condon and Shortley 1951, p. 93). 

13.3 If we use an electric or magnetic field to split the levels 
of a term, show that there can be no electric dipole transitions 
between the levels of one term or even one configuration, but that 
magnetic dipole transitions can occur. In paramagnetic resonance 
at microwave frequencies the quanta of energy are so small that 
one is dealing with transitions inside one term. The samples are 
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therefore placed in a resonant cavity at a position of maximum 
Hy to induce magnetic dipole transitions. 

13.4 Show that all electric and magnetic multipole transitions 
are forbidden for the transition J = 0 to J = 0. 

13.5 Write down the Hamiltonian for an atom (not hydrogen) 
in a uniform electric field. Show that it produces no splitting of the 
levels in the first order of perturbation (Schiff 1955, p. 159). 

13.6 Show that the linear term in the Hamiltonian for an 
atom in a uniform magnetic field H along the z-axis can be written 
BH (Lz orp + 2L2 spin), where the Bohr magneton § is efi/2mc. Show 
that this is effectively (1 + «)BHI, as regards the vector space of 
eigenfunctions of one energy level. Use the relation L? = (J — S)? 
=J?+S?—J-S—S-J to calculate « in the case of Russell- 
Saunders coupling. Hence determine the energy levels for small 
fields, and express them in the usual form E, +918HM 5, where 
gx is the Landé splitting factor 


— 1 4 VAN FSS +1) — LL +1) 
“= QI(J + 1) 


~ 


This splitting is known as the anomalous Zeeman effect (Schiff 
1955, p. 294). 

13.7 Using the perturbation of problem 13.6, calculate the 
energy levels of an atom in a large field H, for which the magnetic 
energy is much greater than the splittings due to spin-orbit coupling 

_but is small compared with the separations between terms (Paschen— 
Back effect; Schiff 1955, p. 294). Sketch schematically how the 
energy levels of a *P term change as the field is reduced to zero. 

13.8 In dipole transitions, how do the intensities of the a and 
o radiation vary with direction (Schiff 1955, p. 253)? Calculate the 
relative intensities of all possible 7 and o Zeeman components 
(in a weak magnetic field in the z-direction) for the transitions 
between a *P,/, and a *P,,. state. Using the g-factors of problem 
13.6 to determine the energy differences, prepare a diagram showing 
the splitting, polarization and intensity pattern that would be 
observed for the *P3,.—>?P,/. line in the z-direction and the 
z-direction (Condon and Shortley 1951, p. 378). 

13.9 The centre of gravity of a set of levels is defined to be 
their mean energy weighted according to the degeneracy of each 
level. Using (13.24) and the sum rule, show that the centre of 
gravity of the levels J of one term is equal to the unsplit term 
energy E, of (13.22). 
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13.10 Calculate all the ¢(LS) for the 2p3p configuration. 

13.11 Consider the (nd)? configuration. 

(i) Work out the allowed terms of this configuration. 

(ii)* Express the energies of the allowed terms in terms of the 
standard radial integrals Fy(nd*), F,(nd?), F',(nd*) (Condon and 
Shortley 1951, p. 193). Find the condition that these F’s must 
satisfy if Hund’s rule is to apply. 

(iii) Find the spin-orbit coupling parameter ¢(Z8) for all terms 
in terms of Lng. 

(iv) Work out the eigenfunctions of the M; = 3, Ms = 0 states 
belonging to the 1@ and °F terms in terms of simple determinants. 
Using the tables of Wigner coefficients (appendix I), find the 
M; = 3 eigenfunctions of both the J = 3 and J = 4 levels of the 
3F term, again as linear combinations of determinant wave functions. 

13.12 An atom in a state transforming according to DW) is 
placed inside a crystal in an electric field due to the surrounding 
ions. Show that the centre of gravity of the 2J + 1 levels is not 
shifted in first order if the crystal potential V contains no constant 
term. Hint: show > ¢mu*¥y transforms according to D!; expand 


V in spherical harmonics; show f{ V > ¢u*¢u dr = 0. 

13.13 D® and D are two irreducible representations. By 
putting @ = 1 in (13.8) and comparing (13.8b) and (13.8c), prove 
that D™* x D contains the identity representation if and only 
if DY = D), 

13.14 Verify directly that (13.8b) and (13.8c) give identical 
results for the selection rule (13.12). 


Chapter ITI 
THE REPRESENTATIONS OF 
FINITE GROUPS 


In the preceding two chapters we developed and illustrated the 
use of group theory in quantum mechanics. In the remaining 
chapters we shall apply these methods to a selection of problems 
from various branches of physics. The present chapter develops 
the mechanical aspects of handling group representations in 
general, At a first reading, only pp. 113-117 and a look at pp. 
125-131 are vital. 


14. Group Characters 


Before we can use group theory in quantum mechanics, we have to 
have systematic procedures, applicable to an arbitrary group, for: 

(i) labelling and describing the irreducible representations; 

(ii) reducing a given representation; 

(iii) deriving all the different irreducible representations. 

We shall not concern ourselves very much with this last point 
apart from just indicating how it is done, because for a simple group 
the irreducible representations are almost sure to be tabulated, or 
at least deducible by inspection from those of a tabulated related 
group. Complicated groups are usually treated by special methods, 
cf. the rotation group (§ 8), the permutation group (Wigner 1931), 
and the Lorentz group (§ 31). 

In §§ 8 and 9 we developed a method for doing the three things 
listed above in the case of the rotation group by using the infinitesimal 
rotation operators. This method can be extended more or less to 
continuous groups in general, but certainly not to groups consisting 
of a finite number of elements because the infinitesimal trans- 
formations cannot be defined. We shall therefore develop a new 
method, that of group characters, which is particularly suited to 
finite groups but which can also be extended to continuous groups 
(Wigner 1931, p. 103). 


Classes 
Let us consider again the point-group 32 of elements H, A, B, 
113 
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K, L, M defined in (4.1) or (4.13). From its multiplication table 
(Table 1), we have 


KAK-1 = B, KBK-! =A, 
LAL“! =B, LBL“ =A, 
MAM-! = B, MBM" =A, 
BAB =A, BBB =B, 
AAA" =A, ABA-1 =B, 
EAE" =A, EBE-! =B, 


and we note that SAS-! and SBS-! are equal to either A or B 
for every group element S. A and B are said to form a class. 
Definition: a class is a set of elements out of a group, such that if T 
ts one element of the class, then STS} also belongs to that class for 
all group elements S. It may easily be verified by direct multiplica- 
tion as above that the three classes of the group 32 are (£), (A, B), 
(K, IZ, M). In general, to sort the elements of a finite group into 
classes, we would start with one element 7', and form all products 
ST,S—1. This gives a set of elements 7,, T,, ... 7s. We then 
form ST,S-}, .. . ,S7,S—1 and in general this gives the old elements 
T,, T,, ... 1; with possibly some new ones 7;,,,.... If there 
are no new ones, then 7), 7, ... 7; form a class; if there are 
additional elements 7;,,,..., then we repeat the process until we 
have found a closed class. It is clear that in the process of forming 
classes, we cannot have an element left over at the end which 
does not fit into any class: for let 7 be such an element. Then either 
all the products ST'S-} are equal to 7, in which case 7 forms a class 
by itself, or one of the products, S,7'S,~1 say, is equal to some other 
element FR say. In the latter case we have 


(S\) RS)? = TP, 


and since S,~! is some group element this equation shows that T 
would already have been included in the class of R in the procedure 
described above. Thus all the group elements can be sorted into 
classes with each element belonging to one class. 


Characters 


Let Dy(T) be any representation of a group of transformations 7’. 
The sum of the diagonal elements of Dj(7') is called the character 
x of 7' in this representation. 


x(T) = 2 DT). (14.1) 
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If S is any matrix, we have using the summation convention, 


[SD(T)S-*}ua = Sy Dp(L)Sei-} = See" Sy Dj (T) 
= 84¢jDjx(T) = x(7). 


From this we have two results. The characters of all elements in 
one class are the same. Also from the definition of equivalence 
(5.15), the characters of equivalent representations are the same. We 
shall later prove the converse of this, namely that two representations 
having the same set of characters are equivalent. Here a set of 
characters means the characters for all the classes in a group. Thus 
such a set of characters is a very useful way of specifying a re- 
presentation. It does not distinguish between equivalent representa- 
tions, which is usually a convenience rather than a hindrance, as 
discussed in § 5. However it does distinguish a given representation 
from all inequivalent ones. Moreover the number of classes in a 
group is considerably smaller than the number of elements, and 
hence so is the number of different characters. In particular the 
irreducible representations of a group can be tabulated easily in 
this way. For instance from Table 3, we have for the irreducible 
representations of the group 32 the characters shown in Table 8. 


TABLE 8 
Character Table for the Point-group 32 


E A K 
xP 2 | 0 
xt 1 1 —l 
xF 1 1 1 


This is called the character table of the group, and the character 
of only one element in each class is usually listed. Clearly the 
identity element EH always forms a class of its own and (Z) is the 
dimension of the representation. 


Reduction of a representation 

Suppose a representation D is reducible into irreducible com- 
ponents D@ + Di +... Di), Then we can bring D into reduced 
form by a transformation P-1D(T)P, and hence the characters are 


x(P) = UP) + (2) +... + xD) 
(14.2) 


— > cy X(T). 
a 
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Here c, is the number of times the irreducible component D 
occurs in the reduction of D. Hence after setting up (7) for each 
of the classes, a representation can be reduced by determining the 
c, by inspection from (14.2), or if necessary by solving the (14.2) 
as simultaneous equations.t For instance consider the representa- 
tion of the group 32 in the vector space (x?, y?, 27, yz, zz, xy). These 
vectors are neither normalized nor orthogonal but they still form 
the basis for a representation in this space, with D(A) being given 
above equation (5.20). By inspection the characters are 


x(#) =6, x(A)=0, x(K) =2. 


Hence from the character table of the group (Table 8) and solving 
the simultaneous equations (14.2) by inspection, we have 
Cr = 2, Cy = 0, Cy = 2, 
and D=Ir+Tr+4+4+Y, 
as was found before in (5.21). 

This method can be applied easily to product representations 
(§ 9). Let uw, v; be base vectors transforming according to the 
representations D™ and D™ of a group, and 7 some transforma- 
tion of the group. Suppose 


Tuy = Ty ug + other terms in ux, 

Tv; = Ty; vy + other terms in 1. 
Then ZT (ugvj) = Ta Ty (uyvj) + terms in (uzv). Hence if we 
take (wv;) as the nt“ base vector in the product space, the diagonal 


element Dan(Z) of the matrix D(T) of the product representation 
D= DY x D™ is just Ty Ty. Therefore 


x(L) = 2 Dan(T) = 2 Pe Ty = X(T) x(T). | 14.3) 


Ie. the characters of a product representation are the products of 
the constituent characters. For example from Table 8 we obtain for 
the product representation I x I of the group 32 the characters 
x(Z) = 4, x(A) = 1, x(K) = 0, 
whence from (14.2) 
PK@M=PF+H4S. 


t We shall prove later that these equations have in fact always a unique 
solution (see below equation (14.18)). 
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Another application of (14.2) is to determine according to what 
representation a set of spherical harmonics transforms under a 
group consisting of a finite number of rotations. Let Rot(¢, &;) 
be a physical rotation in the sense of § 2 about the axis &,, and Rot(S) 
any other rotation. Suppose Rot(S) rotates the axis &, into €,. Then 


[Rot(8)}? Rot(¢, E,) Rot(S) = Rot(¢, &). 


For Rot(S) rotates §, into the position §,, Rot(¢, §,) then applies 
a rotation of ¢ about it, and [Rot(S)]-1 moves it back to its original 
position &,. Thus all rotations by one angle ¢ about different axes 
belong to the same class of the full rotation group. By considering 
Rot(¢, z)um = e'"4um, we obtain for the characters of the irreducible 
representation D of the full rotation group 


(14.4) 


The spherical harmonics Yj, transform according to D™ under 
the full rotation group, and therefore they also form the basis of a 
representation D of some finite point-group. Now the matrices 
of D are just the appropriate matrices taken from D, so that the 
characters of D are given by (14.4) with j =1. The representation 
can now be reduced by (14.2). Consider for instance the spherical 
harmonics with | = 2 transforming according to a representation 
D under the group 32. For A and K, ¢ is —120° and 180° res- 
pectively and 


x(#) =5, x(A4)=—-1, xX(K)=1, 
whence from (14.2) and Table 8, 
D=s=I+TPr4+V. 


Thus if we have the character table of some group—and this 
can usually be looked up, for instance in appendix K or L—we can 
find the irreducible components from (14.2) of any given representa- 
tion. We have therefore achieved the main purpose of this section. 
The remainder of the section is devoted to deducing some of the 
other important properties of characters which are required for 
other applications. 


Orthogonality relations 
Consider the matrix 


P = > D°(T-2) G D&T), | (14.5) 
T 
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where D@), D™ are two particular irreducible representations of a 
group of h elements. C is any matrix, and the summation is over 
all group elements 7. We have 


PD\(8) = DS) F D(S-1T-1) CD“ (TS). 
T 


Here S is any group element, and S-17'-! = (7'S)-1 (problem A.5). 
Now the elements 7'8 are just the group elements rearranged, for 
there are h of them and we cannot have 7S = RS unless 7 = R. 
Hence the summation over JT can be replaced by a summation over 
TS and we have 

PDS) = DO(S)P. 


P therefore satisfies the conditions for Schur’s lemma (appendix 
D), and we have P =0 if D® and D™ are inequivalent, and 
P =a,E if they are identical, where a, depends on the particular 
matrix C. Since C is arbitrary, let us choose all elements zero 
except one, C,, = 1. This gives 


2 Dog P-) Dy (T) = ng 79 Aus (14.6) 


where we have used 8,,, to denote 1 when D® and D™ are identical, 
and zero when they are inequivalent. When they are identical, we 
evaluate a,, by putting 8 =j and summing over the repeated 
suffix. Thus 


> Depa! (L-")Dyg(P) = nytag = hda5 
T 
since D®(T-1)D(T) = EZ. Here n, is the dimension of D. 
Hence a,, = hé,,/n,. Also . 
DT) = (DT)? = (DD) (14.7) 


if we suppose the representation is unitary (problem C.2). Thus 
(14.6) becomes 


h 
2 Dig"(T) Dy(T) = an 8.45 5;5ap- (14.8) 


This is the fundamental orthogonality relation of the first kind for 
irreducible representations. By putting 1 = j, « = 8 and summing, 
we obtain the orthogonality relation for the characters 


2x" T) xT) = hd,,. (14.9) 
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Projection operators 


We now discuss the reduction of a given vector space and the 
corresponding representation and vector space in further detail. 
First we can use (14.9) to obtain the explicit solution of equations 
(14.2) determining the irreducible components of any representa- 
tion. Multiplying (14.2) by x%*(7') and summing, we obtain 


] 
= 5 > XO) X(7). (14.10) 
T 


The uniqueness of the reduction shows that two representations 
with the same characters are reducible into the same irreducible 
components and hence are equivalent. 

Let & be a vector space transforming according to a representa- 
tion, which can be reduced into irreducible components by (14.10). 
It remains to pick out the actual vectors in R transforming according 
to the different components. Let D,,“(7') be one of the irreducible 
component representations, and ¢ any vector of R. Consider the 


vectors 
$i = 2 Dy*(T) Th (14.11a) 


for any fixed value of 7. Then from (14.11a) and (14.7), we obtain 
Sh,% = J Dy*(T)ST¢Y 
T 
= D,,*(8-1) 5 Dy *(ST)ST¢ 
T 


= Dy(S)¢., 


which shows that the ¢,” are vectors in R transforming according 
to the particular irreducible representation D,(7). Putting i =j 
in (14.1la) and summing, we have that 


g% = 2 x*(T) T¢ (14.11b) 


belongs to an irreducible subspace transforming according to D™), 
This relation is useful if we only have the characters and not the 
particular matrices D,,”)(T), or if we do not require a proper set of 
base vectors in the subspace. In vector analysis, i- v is the projection 
of the vector v on i and is the part of v that lies in the direction i. 


9 
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By analogy the operators in (14.11) are called projection operators, 
because they pick out from ¢ the part lying in a particular subspace. 


Matrix elements 


We can now give an alternative derivation of the fundamental 
theorem of §13 on the calculation of matrix elements. Let 4,” 
be a set of functions transforming according to the irreducible 
representation D®. Since an integral is invariant under trans- 
formations (lemma 2, appendix C), we have 


h f $% dr = 5 | Tb, dr 
= 3 Dy (T) | $i dr 
Tv 


= [>1 -Dy(T)] | $i dr. (14.12) 
T 
(¢ summed) 


Here the 1 in the square bracket denotes the element D,,(7') of 
the identity representation. Thus from the orthogonality relation 
(14.8), (14.12) is zero unless D” is the identity representation, 
ie. we have [¢," dr =0. If D” is the identity representation, 
then the 7 and j suffices become superfluous and (14.12) becomes 


hfgm dr = h fom dr 


and the integral in general has some arbitrary non-zero value. 
If we now take ¢ to be some function in a vector space & trans- 
forming according to a reducible representation D, then we can 
express ¢ in terms of the irreducible base vectors of R, and hence 
Jédzr =0 if D does not contain the identity representation. If D 
contains the identity representation n times, [4 dv is a linear com- 
bination of n undetermined integrals. We have therefore proved 
the theorem of §13 on the evaluation of matrix elements in the 
particular form of result (13.8c). 
Let us consider the reduction of 


D = D®* x D™, 


where D® and D™) are irreducible representations. The characters 
of D are given by (14.3). Suppose D contains the identity re- 
presentation c times. The characters of the identity representation 
are all unity, so that from (14.10) ¢ is given by 


1 
t= i » } M*(T) (4 (T) 
T 


= 6), from (14.9). (14.13) 
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In words, D”* x D™ contains the identity representation once, if 
and only if DY = D), This is just the result of problem 13.13. With 
this and (13.8c) already proved, (13.8b) and part III of the theorem 
of § 13 follow immediately, and parts I and II then as special cases, 

The regular representation D*°¢ of a group is defined as follows. 
First write out the group multiplication table, rearranging the rows ° 
so that all #’s occur on the diagonal, as shown in Table 9 for the 


TABLE 9 
Construction of Regular Representation 


£-\= F 
A-1= B 
Bi=zA 
K3= K 
Lot= L 
M-i= M 
= 7,7 
applied 
first 


point-group 32 (4.1). The columns are labelled by the elements 
T; and the rows by 7;-!. Then the matrix D'¢®(A) is obtained 
by putting all A’s in the table equal to one and all other elements 
zero. Thus Dj7®(A) = 1 when 7;7;"* = A, ie. when 


AT; = T, = Die®(A)T. 


Thus the regular representation is a representation among the group 
elements considered as base vectors. Now by construction, 
xree(E) =h and x8(7) =0 for TAH. We may reduce Dre, 
and applying (14.10) we find c, = m,. Le. every irreducible representa- 
tion D™ 18 contained in the regular representation n, times, where n, is 
the dimension of D”. Counting up the total number of base vectors, 
we obtain as a by-product the relation 
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In the reduction, let Y,;°, 7 = 1, 2, . . . denote the 7*® set of base 
vectors transforming according to D”). 

We shall now construct all transformations P that commute 
with every element of the group, and that are linear combinations 
of the group. By the latter condition P transforms every vector 
space Y,°"),1=1, 2, .. . into itself. Hence P is represented with 
respect to the Y,“") by a matrix in reduced form. Since we also 
require it to commute with all group elements, it must, by Schur’s 
lemma (appendix D), have the particular form diag {a,,H, a,,H,..., 
Q,H, Gy,H, . . .} where the E’s are little unit matrices along the 
diagonal. Further it must retain this form if we choose new base 
vectors Y,°7) + Y,“*), whence ay = d,, = a, say. Thus we can 
write P in the form 


where the E, are operators such that Z,Y,°" = Y,0”, H,Y,4" =0, 
and where the a, are arbitrary constants. H, is the unit operator 
inside all the subspaces transforming according to D”). Now we can 
also construct P in a different way. Since P commutes with any 
element S of the group, it must contain the elements 7’ and STS“! 
an equal number of times. Hence P can be expressed in the form 


P= > by Mx, (14.15b) 
Kk 


where M; is the sum of all elements in the k'® class. Thus P can be 
considered as a vector in a space spanned by the £, or the M,. 
Either way the dimension of this space has to be the same, whence 
comparing (14.15a) and (14.15b) we have 


number of irreducible representations = number of classes. 


(14.16) 
Let the number of elements in the k*® class be hy. Then (14.9) 


can be written 
hp\ 2 hy\ 12 
ee ea 
k 


If we define the matrix U such that 
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then (14.16) shows that U is square. Also (14.17) becomes U*U = E. 
Hence U is unitary and we also have U*U = E, which when written 


out becomes 
1/2 hy\}/2 
> ()"we(B) “amie a 
A 


These are the orthogonality relations of the second kind. The unitary 
property of U also ensures that the simultaneous equations (14.2) 
always have a unique solution [namely (14.10)]. Thus we have an 
alternative proof of Theorem 3, appendix C, namely that the 
reduction of a representation is unique, apart from equivalence. 

As an application of the above formulae, we shall! set up from first 
principles the character table for the group 32. As already deduced, 
there are three classes, (Z), (A, B), (K, L, M). From (14.16) there 
are three irreducible representations, and (14.13) becomes n,? 
+ 29? + ng* = 6. The only solution is n, =n, =1, nz =2. We 
always have the identity representation with characters 1, 1, 1. 
Since K* — H, A® = E, the only possibilities for the characters of 
the second representation are 1; 1, w or w?; +1; where w? = 1. 
Relation (14.17) with A=1, » = 2 gives 1, 1, —1 as the only 
possibility. The third representation has y(#) = 2. Then (14.18) 
with j =1, k =2 and j =1, k =3 determines the other two 
characters, namely —1 and 0. This gives just the character table as 
shown in Table 8. For more complicated groups the result of prob- 
lem 14.9 can be very helpful. 


References 


More detailed accounts are given of the properties of characters 
in Speiser (1937, pp. 166-175), of the method of characters applied 
to continuous groups in Wigner (1931), and of characters in general 
in Littlewood (1940). A systematic method of setting up the 
character table is given by Bhagavantam and Venkatarayudu 
(1951, p. 254). 


Summary 


Group characters have been defined, and used to label representa- 
tions, particularly the irreducible representations. Any representa- 
tion may be reduced using (14.2) and (14.10). Functions with given 
transformation properties may be determined using the projection 
operators (14.11). The two types of orthogonality relations and 
other results may be used to derive the characters of all the irreducible 
representations of a group. 
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PROBLEMS 


14.1 The spherical harmonics of order 3 transform according to 
D under the point-group 32. Reduce D. 

14.2 A rare-earth ion in the free state is in a state with J = 1 
with energy H,. In a crystal it is in an electric potential with 
symmetry 32 and mean zero. Show that the level is split into two 
levels H,, H,, and determine the ratio (HZ, — E))/(H, — E,) (ef. 
§ 6, also problem 13.12). 

14.3 Prove equation (9.2) using group characters. 

14.4 w,, u_ transform according to D@/® of the full rotation 
group and according to D under the point-group 32. Try to reduce 
D. What has gone wrong ? 

14.5 Use (14.11) to find base vectors in the space (x?, y*, 23, 
yz, zx, xy) transforming under the group 32 according to the irreduc- 
ible component representations (cf. equation (5.20)). 

14.6 Show that in an Abelian group each element forms a class 
by itself. Hence deduce that all the irreducible representations are 
one-dimensional. 

14.7. From (14.10) show that > |x(7)|? = > c,?. Hence show 

T A 


that a necessary and sufficient condition for a representation to be 
irreducible is that > |x(7)|? = h. 

14.8 Show (14.14) is a particular case of (14.18). Also deduce that 
>mx(T) =h if T = EH, and is zero otherwise. 


a 

14.9 Let C; be the i class of a group. The product C;C; denotes 
all elements obtained by multiplying those of C; by those of C;. 
Show that these can be grouped into classes. Let the class Cy, 
appear cy, times. This is written C7Cj = > ciyzCy. With this nota- 
tion, show that . 


hahyys xy = OE) >, ela”. 


Hint: show My = 7, (14.15b) as regards its operating on irreducible 
vector spaces; inj = > Ciyxnx; the character of My is nex (BZ) = 
hixi™ (Speiser 1937). 

14.10 Set up the character table for the point-group 422. This 
is the group of proper rotations that leaves a square invariant. 
Use any of the relations of this chapter and of problems 14.7, 14.8, 
14.9, and trial and error to set up the character table for this group. 
Verify the result of problem 14.9 in a few cases. Also reduce all the 
product representations DY x D™, 

14.11 ¢; and ¥,;‘” transform irreducibly according to D™ and 
D), Show that My; = fi*yy dr = [(T':)*(Tyy) dr. Hence using the 
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orthogonality relations show that M;; =0 when D” and DW 
are inequivalent. Deduce also the result IB of the fundamental 
theorem of § 13. 

14.12 In the fundamental theorem of § 13, in (13.8) and problem 
13.13, in (14.12), (14.13) and problem 14.11, we have a series of 
closely related results with at least two independent possible lines 
of proof. Summarize how these are all logically inter-related. 

14.13 Suppose you are given the characters of an irreducible 
representation. How would you construct in a systematic way an 
actual set of matrices to constitute the representation? 

14.14* Discuss the application of group character techniques 
to continuous groups, in particular to the rotation group (Wigner 
1931, p. 97). 

14.15 A set of base vectors ¢; transform according to the 
irreducible representation Dy 7’) of some group of transformations 
T. Show from the orthogonality relation (14.8) that 


d= 2 Dy*(T) Td, 


where the suffix j is not summed over. Thus having once obtained 
one vector ¢; of a representation, we can define all the other ¢, 
in terms of it. 

14.16 DT) is an irreducible representation of a finite group, 
and D*(T) the representation which is obtained by taking the 
complex conjugate of every matrix of D®(T). If D™ and DM* 
are not equivalent, show from the orthogonality relation (14.8) that 


> xT?) = 0. 
T 
If D™ and D* are identical, i.e. real, show similarly that 
S(T) =h 
T 


where h is the number of group elements. For an application of 
these results, see equation (19.22). Note: D(T?) = D(T)D(T) in 
any representation. 

14.17 Prove just from the orthogonality relation of the first 
kind (14.9), that the number of irreducible representations of a group 
must be less than or equal to the number of classes. 


15. Product Groups 


Irreducible representations of product groups 


Let us first consider the full rotation and reflection group, which 
consists of all proper and improper rotations about a point. In 
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particular it contains the inversion JZ (3.11), and any improper 
rotation can be written RJT where R is a proper one. Let R be a 
vector space invariant under this group. In ® we can use the proper 
rotations to pick out a set of base vectors %m transforming according 
to D' under the rotation group, where for the present we shall 
consider J to be an integer. Then we can form the functions 


tins = ¥m + Ibn, tm = ¥m — Ibm, (15.1) 


Ibm, =%m,, bm = —bm_. (15.2) 


Now /7 commutes with every proper rotation. Hence operating 
on (15.1) with some rotation R we obtain 


Rim, = Rim + Rlbm = Rbm + TIRbm 
= D®(R)(by + Tn) = D®(R)bn4- —*(15.8) 


The Ym, transform therefore according to D“ under proper rota- 
tions. Thus from (15.2) and (15.3), the %m, form a vector space 
which is invariant under both proper and improper rotations, and 
which forms the basis of an irreducible representation D@*+ of 
the rotation and reflection group. Similarly the %m_ give an irreduc- 
ible representation which we shall denote by D‘-), When the 
vm transform according to DY under proper rotations, where j is 
half an odd integer, the representation is double-valued. Since 
IT? = E and E is represented by plus or minus the unit matrix, we 
can similarly construct functions mi, me such that 


TNbnj = +4 IN m;4 = Lhtpint- (15.4) 


These give representations which we shall denote by D+ and 
DEH+DYO, 

We shall not prove here that the above representations are the 
only irreducible ones, but proceed to the general theory. Consider 
two groups a, and g,, which are such that every element P, of 9, 
commutes with every element 7’, of do, ie. 


PT, = TP. (15.5) 


Then we can form a group © consisting of all products P,T, or 
T,P, of elements from 9, and gg. Clearly these products satisfy 
the group postulates, e.g. (P,7')(Q,S2) = (P,@,)(T,8.) is always 
another such product. For instance if q, is the full rotation group 


so that 
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and g, is (L, IT), we have the rotation and reflection group. Other 
examples are the following pairs: 


if 


91 . Se 
rotation of all co-ordinates permutation of all co-ordinates 
Tt, Oz Ti, Ozt 
rotation of all orbital rotation of all spin 
co-ordinates r; co-ordinates o24 
rotation of co-ordinates rotation of rz, oz, 


Ty, Tz 


The group © is called the direct product of a, and gp», written 


We shall now construct all the irreducible representations of 
© =a, X gg. In any invariant vector space we can construct a 
set yy, «= 1, 2, . . . transforming according to the irreducible 
representation DO” of g,. Let 7, S be any elements of g,. Then 
from ¥, we can generate a set of functions Ty,, Sy, etc., which 
clearly transform into one another under g,, and we can reduce 
them and form a set of linear combinations 


Pra 7 2 braT yy 
T 


transforming according to some irreducible representation D‘%) 
of a4. Using the other 4, let us now define the functions 


Yin = (Doral he. (15.7) 


Since any element of g, commutes with every 7’, it commutes with 
the operator 5b7,T in (15.7), and the 4,7 =1,2,.. . transform 
under g, in exactly the same way as the yy, i.e. according to D\), 
What is now the effect of operating on y¥;, with an element S of 
3,? Since S commutes with any element of g,, Sy;,, 4 = 1, 2,... 
transform according to D4). Hence from part I of the fundamental 
theorem of § 13, we have 


Stine — 2. %(2 ep 


where a@,(«) does not depend on 7. Thus the y,,, a =1, 2,... (4 
fixed) transform into one another under g, in the same way as the 
1,, 1.e. according to D‘'*). We have therefore obtained a rectangle 
of functions ¥,, such that the functions in each column (a fixed) 
transform under g, according to the irreducible representation D®), 
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and the functions in each row (¢ fixed) transform irreducibly accord- 
ing to D') under g,. Together they give the representation DO*2) 
of the product group 9, X g», and this representation is clearly 
irreducible. In this way we can construct systematically all irreducible 
representations of a, X G2 by combining each D™) with each D\), 

Applying this to the earlier example, the irreducible representa- 
tions of £, IT are 

x(£) = 1,1, +1, +1; 

When j is half an odd integer, DV'+) and D%—) are identical with 
DG*) pecause D!) is already double-valued and JJ = JE is 
represented by -+ the unit matrix. This gives just the two irreducible 
representations DY+, Di+0 already obtained before. When 1 
is an integer, we have the four representations D+), Dé-), 
DE+V, D+, but the last two have such unphysical properties, 
(single-valuedness under proper rotations, double-valuedness under 
IT) that they have found no application (as yet!). 


Summary 


We have defined what is meant by a product group and shown 
how to construct all its irreducible representations out of those of 
the component groups. 


PROBLEMS 


15.1 In the notation of (15.5), verify that all the products 
P,T, satisfy the group requirements of § 4. 

15.2 The functions u,;, ui, 1 = 1, 2, 3, are spin functions for 
three electrons. Sort out linear combinations of the products 
Us1 Ut. Utz into rectangles that transform irreducibly under the 
product group (rotation of all oz) x (permutation of oj). Hint: 
the correct linear combinations can be obtained using problems 
11.5 and 5.17. The answer is given in (28.16). 

15.3 What are the complete symmetry groups of Hosct, H orp: 
H ory +A gpin defined by (10.9), (10.2), (11.8), (11.9)? Interpret 
some of the degeneracies of §§ 10, 11 and 12 in terms of representa- 
tions of product groups, particularly (10.8), (11.17). In terms of the 
complete symmetry group of # oy +H spin, What representations 
are allowed by the exclusion principle? 


16. Point-groups 


A point-group is a group of rotations about a point, i.e. of rotations 
whose axes intersect at a point. This distinguishes them from the 
more general space-groups which involve translations and rotations 
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about non-intersecting axes. In this section we first describe and 
tabulate the thirty-two point-groups of crystallographic interest. 
Then follows a more detailed discussion of selected aspects of point- 
groups, namely: how all the crystallographic point-groups are 
derived; other point-groups such as apply to molecules, including 
those derived from the axial rotation group; how the character 
tables are derived; and derivation of the double-valued spin re- 
presentations. The complete character tables of the most important 
point-groups are tabulated in appendices K and L. 
Crystal point-groups 

In a crystal the atoms are arranged in a regular periodic array, 
and the fact that it is periodic severely restricts the kind of rotational 
symmetries that the arrangement can have. The rotations by all 
multiples of 360°/n are referred to as an n-fold rotation axis, and it 
will be shown below that crystals can only have 1-fold, 2-fold, 
3-fold, 4-fold and 6-fold axes. For instance it is impossible to have 
a periodic array with five-fold symmetry. These proper rotations 
are given the symbols 1, 2, 3, 4, 6, and in addition there can be 
improper ones. A roto-inversion axis % consists of all multiples of a 
rotation by 360°/n followed by the inversion through the origin [7 
(3.11). In particular 1 is just H and the inversion itself, and the 
group 2 is # plus a reflection m in a mirror plane perpendicular to the 
axis. In fact the axis 2 is usually denoted by m. There are also 
roto-reflection axes %, whose basic element is a rotation by 360°/n 
followed by a reflection in a mirror plane perpendicular to the axis. 
In these improper axes n again can only have the values 1, 2, 3, 4, 
6. The two types of improper axis are not independent. In fact the 
roto-inversion axes are used almost exclusively now although in the 
older literature, particularly in the Schoenflies notation, the roto- 
reflection axes were adopted as the basic improper axes. We have 
the relations between them I = 2 = m, 2 = 1,3 = 6,4 = 4,6 =3. 

These and other facts about point-groups can be seen mostly 
easily by drawing stereograms. Consider a sphere with a spot 
marked on it at some arbitrary point. If we apply all the rotations 
of a point-group to the sphere, we can represent each element of the 
group by the position taken up by the spot. The position can be — 
plotted by taking a bird’s-eye view of the sphere from above the 
north pole, a dot meaning a position in the northern hemisphere 
and a circle one in the southern hemisphere. Such a diagram is 
called a stereogram.t Thus if we plot a dot or circle for each element 


{ This description of a stereogram is technically not quite correct (Phillips 
1946), but it is near enough for our present discussion. 
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of a point-group, we obtain a pattern of dots and circles representing 
a set of equivalent directions in space which are rotated into one 
another under the action of the group. It can now be seen, for 
example, that the stereogram for the point-group 3 (Fig. 9) is 


Type 


Triclinic Monoclinic Tetragonal Trigonat Hexagonal Cubic 
and 
Orthorhombi 


n/m |4/m=m “Oe 


* 


te 


fim 


n2 


eo 


222 422 32 
eo foRo} 
a/mm|\ \inme2 “() 3/mm =6m2 
eo eo 


mmm = 4/mmm 


Fia. 9. Stereograms of crystal point-groups. 


invariant under a 60° rotation followed by a horizontal reflection, 
so that 3 is equivalent to 6. 

We can form more complicated point-groups by combining two 
or more axes. For instance the stereogram for 422 (Fig. 9) shows 
that this point-group contains a four-fold axis (perpendicular to the 
page) and four two-fold axes in the plane of the paper. Fig. 9 shows 
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all the thirty-two point-groups that can apply to crystals. They 
are labelled in the abbreviated international notation. First a 
complete symmetry symbol is constructed as follows. The main 


axis is given first. Then 2 usually written n/m, denotes a mirror 
m 


plane perpendicular to this axis. Then follow other non-equivalent 
symmetry axes and planes. For instance 4mm has two pairs of 
vertical mirror planes which are at 45° to one another and hence 
are not equivalent to one another under the action of the group. 
The complete symmetry symbols are then abbreviated, just enough 
being retained to characterize the group completely. Appendix J 
shows the complete symmetry symbols, the abbreviated (inter- 
national) symbols, and also the older Schoenflies symbols for all 
the point-groups. 

In the point-groups of the cubic system, the 3-fold axes are 
along the body diagonals of a cube, the 4-fold or main 2-fold axes 
perpendicular to the cube faces, and other 2-fold axes parallel to 
the face diagonals. For a fuller description of the point-groups, 
we refer to any book on crystallography (e.g. Phillips 1946, Buerger 
1956). 

The character tables of all the crystal point-groups are given in 
appendix K, where the notation for the irreducible representations 
is also explained. The derivation of the character tables is discussed 
later in this section. 


Derivation of the crystal point-groups 


We shall now show how the point-groups are derived and why 
there are only a finite number. Very complete derivations of this 
and also of the space-groups have been given by Seitz (1934, etc.), 
Zachariasen (1945) and Murnaghan (1938). To bring out the main 
ideas, we shall consider in detail the point-groups which consist of 
proper rotations only. There are eleven of these (Table 10). We 
shall derive them from first principles and show no others can exist. 
The improper point-groups may easily be derived from these as 
shown in the above references and in problem 16.4. 

We first prove that only n-fold axes can exist, where n = l, 2, 
3, 4 or 6. Because of the periodicity of the crystal structure, any 
translation 


t = Nya, + NgAs + NgAg (16.1) 


is a symmetry element and conversely any purely translational 
symmetry is of the form (16.1). Here the a; are the primitive 
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translation vectors, which in general do not have the same length 
and may make skew angles with one another. The 7; are integers. 
Let R(f?) be a purely rotational symmetry. Then R(¢)tR(—¢) 
is another pure translation, say 


t’ et n' Ay + N' py + N' ,As, 


which is just the translation t rotated by an angle ¢ as follows 
directly from (5.4}) or from a diagram. If we write 


n'; = Dy(d)m, (16.2) 


then Dy(¢) forms a representation of the rotation. By choosing 
new base vectors i,, i,, i, along the co-ordinate axes, we see that the 
representation (16.2) is equivalent to the representation D“™ based 
on the functions x, y, z. The character of Djy(¢) is therefore from 
(14.4) 


x(¢) = 1 + 2 cos ¢. (16.3) 


Now if we put n, = 1, n, = n, = 0, then since the n’; have to be 
integers, (16.2) shows that the matrix elements D,j(¢) are integers 
and similarly so are all the Djj(¢). Consequently the character 
is an integer since it is just the sum of the diagonal elements. 
Comparing with (16.3), we have 1 + 2 cos ¢ is an integer, whence 
the allowed values of ¢ are given by 


1+2cos¢= —1, 0, l, 2, 3: 
¢ = 180°, +120°, +90°, +60°, 0°. 
le. ¢ is a multiple of 60° or 90°, which limits the rotational sym- 


metries ton = 1, 2, 3, 4 or 6. The allowed angles can conveniently 
be written 


@ = 2nrk/n, n = 1, 2, 3, 4 or 6, 


16.4 
b= 1, 2,55 12m. sabi 


We now deduce a relation between the numbers 8, of n-fold axes 
in one point-group &. Let S be the sum of all the elements of & 
which we will assume has at least two rotation axes. We can write . 


S=E+>D Snr, (16.5) 
nt 


where Sy, is the sum of all elements except H belonging to the 
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rth y-fold axis. EH belongs simultaneously to all axes. Summing 
(16.3) over all angles (16.4) with k = 1, 2,..., ”-1 gives the character 
of a single Snr, 


x(Snr) =n + 2 > cos 2rk/n — x(H) =n — 3. 
k=1 
Thus from (16.5), 
6 
x(S) = 3 + 2 (n — 3)sn = 3 — 8, + 84 + 33. (16.6) 


We now calculate y(S) in another way. Let R, and R, be rotations 
about two different axes. The collection of elements R,G is just 
the group © itself (cf. below equation (14.5), also problem 4.10). 
Hence R,S = RS = 8S. From (16.2) we may regard a rotation as 
a tensor that turns the vector t into t’ and write t’ = R(¢)-t. In 
this notation, we have 


R,- St =R,- St = St. 


Now R, : St = St implies that the vector St is parallel to the rotation 
axis of R,. Likewise it is parallel to the second axis, and since these 
were assumed to be different, St = 0 for all t, whence S = 0 and 
x(S) = 0. Substituting into (16.6) we obtain 


which is the required result. This relation does not apply to point- 
groups having only one rotation axis. 

We now consider the angles between rotation axes. Suppose 
R(¢,, &:) and R(¢,, &,) are two rotations belonging to a point- 


group. Then 
Rds, Es) = R(bo, Es) R(dy, E) 


has to be another allowed rotation, so that ¢, is one of the angles 
(16.4). Now ¢z is given byt 


cos $¢, = cos $4, cos $4, — sin }¢, sin $4, cos y, _—(16.8) 


where y is the angle between the &, and E, axes. Consider the case 


+ The reader can prove this formula as follows. Let T, and T, be the dyadics 
representing the first and second rotations (see problem 8.4). The dyadic 
T,:T, then represents the resultant rotation, and its scalar (sum of diagonal 
elements) is therefore 1 + 2 cos $5; i.e. T,:T, = 1 + 2 cos ¢5. This equation 
reduces to (16.8) after some manipulation. For a more geometrical proof, 
see Buerger (1956, p. 38). 
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when &, and &, are both 6-fold axes. Putting 4, = ¢, = 60° and 
¢, = 60°, d, = 120° we obtain respectively 


cos $¢3; = 7 — 3 cosy, 


and cos $$’,3 = 31/3 — 41/3 cos y, (16.9a) 
whence eliminating cos y we have 
cos #45 — /$ cos Ad's = F. (16.9b) 


From (16.4) the allowed values of cos 44, and cos 4¢’, are +4, 
+4+/3, ++/}, 0, +1, and we see by inspection that the only 
allowed solutions of (16.9b) are 

(i) cos $4, = 4, cos $¢', = 0; 

(ii) cos $¢, = I, cos $4’, = 4/3; 

(iii) cos $6, = 0, cos $4’, = —4$+/3. 
From (16.92), these solutions give respectively 
(i) cosy = 1, y=0°; (ii) cosy = —1, y = 180°; (iii) cosy = 3, 


Case (iii) is meaningless, and in either of the other cases the axes 
E1, &, coincide. Thus there can be at most one 6-fold axis in a point- 


group. 
TABLE 10 
Number of Axes in Proper Point-groups 


8 33 & Se point-group 
(i) 0 0 0 0 1 
(ii) 1 0 0 0 2 
(iii) 0 1 0 0 3 
(iv) 0 0 1 0 4 
(v) 0 0 0 1 6 
(vi) 6 0 0 1 622 
(vii) 4 0 1 0 422 
(viii) 6 4 3 0 432 
(ix) 3 1 0 0 32 
(x) 3 4 0 0 23 
(xi) 3 0 0 0 222 


We can now tabulate ali the possible combinations of axes. If 
there is only one axis we have the point-groups (i) to (v) (Table 10). 
We next consider the case when there is a 6-fold axis; there can 
only be one as we have just proved. From this it follows that there 


REPRESENTATIONS OF FINITE GROUPS 135 


can be no separate 3-fold axis since this would require at least three 
6-fold axes to give a 3-fold symmetry. Likewise there can be no 
four-fold axes. Then (16.7) gives only case (vi) (Table 10) with 
8 > 0, and in enumerating the other possibilities we shall assume 
without further mention that s, = 0. Consider the case of a single 
4-fold axis. There can be no 3-fold axes because these would lead 
to more than one 4-fold axis, but from (16.7) there are four 2-fold 
axes, giving case (vii). If there are two or more 4-fold axes, it 
can easily be shown from (16.8) that they intersect at 90°. Now 
from (16.8) or problem 8.4, R(90°, z) R(90°, y) is a 120° rotation. 
Hence we have a 3-fold axis and a third 4-fold axis in the z-direction. 
There can be no more 4-fold axes because there are no further 
directions at right angles to each of the 2-, y- and z-axes. Further 
there must be four 3-fold axes to preserve the four-fold symmetry, 
which gives case (viii). This exhausts the cases with 4-fold axes. 
If there is one 3-fold axis, any 2-fold axes must be perpendicular 
to it and from (16.7) there are three of them, giving case (ix). If 
there are more than one 3-fold axes, (16.8) gives cos y = +4. 
Further ¢, = ¢, = 120°, cos y = } gives ¢, = 180°, i.e. a 2-fold 
axis. Then from (16.7) there must be three 2-fold axes, which in 
turn generate four 3-fold axes. The latter are oriented with respect 
to one another as the body diagonals of a cube, and the cos y = +4 
leaves no room for more. Hence we have only case (x). From (16.7) 
there remains only the possibility of three 2-fold axes, case (xi). 
Having derived all the possible numbers of axes, it can now be 
shown that for each case of Table 10 there is essentially only one 
way of arranging the axes in space. This follows from elementary 
geometrical reasoning. For instance in case (x), the restriction 
that the three-fold axes intersect at cos—4(14) allows only one 
arrangement of them. The two-fold axes must lie either perpendicu- 
lar to each three-fold axis or bisecting the angle between two of 
them. For there to be only three two-fold axes it can easily be seen 
that the latter alternative must hold, which fixes their directions 
uniquely. This completes the derivation of all proper point-groups, 
and the ones with improper axes may be derived from them as 
already mentioned (Zachariasen 1945, p. 40; problem 16.4). 


Character tables and other point-groups 


Besides the crystallographic point-groups, there are other finite 
subgroups of the full rotation group. Using the same notation as 
before, an n-fold rotation axis 1 is a cyclic point-group for any 
value of x. We can add 2-fold axes perpendicular to it to obtain 
the dihedral group n2. Similarly by adding mirror planes (improper 


10 
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2-fold axes) we obtain the groups 7, n2, n/m, nm, n/mm, v, % 2m, 
ete. These are not all different, but which ones are the same 
depends on whether we have n = 2q + 1, 4g, or 4g + 2 where ¢ 
is an integer (cf. Fig. 9). Apart from these there is only one other 
proper point-group, and that corresponds to the symmetry of a 
regular icosahedron (Murnaghan 1938, p. 336). 

In addition to the above finite point-groups, there are several 
subgroups of the full rotation group that are associated with the 
axial rotation group. The latter has the symbol oo in the present 
notation, and the other derived groups are o0/m, 002, oom and 
co/mm. These are very important for describing diatomic molecules 
and we shall now derive their character tables. Firstly oo is a 
cyclic group and its irreducible representations have been derived 
in § 7. Each rotation forms a class of its own (cf. problem 14.6), 
and the character table is as shown in Table 11 and appendix L. 


TaBLE I] 
Character Tables of 00 and com 


Irred. The group 0 
rep. xB) xIR(¢, 2)] 
Ay 1 1 
Ax 1 exp(tk¢) 
A_x 1 exp( —tke) 
(k 2 1) 
Teed: The group oom 
rep. x(#) x{R(¢, z)] x(m,) 
Ay 1 l 1 
A_ 1 1 —1 
Ey 2 2 cos kp 0 
(k 2 1) 


Note: The notation for the irreducible representations is not quite the standard 
notation of appendices K and L, but is more suited to the present discussion. 


Since there are an infinite number of classes we expect an infinite 
number of irreducible representations. Consider next the group 
com, operating on some invariant vector space. This space can 
first be reduced according to the axial rotation group, and let 
¥, be a vector in it which transforms according to the representation 
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exp(tkd) of Table 11. In spherical polar co-ordinates, the reflection 
mz perpendicular to the z-axis induces the transformation 


@=-8 =—¢. 


If we put 
2 = Mz), 
then we have 
R(¢, zt = R(¢, z)mzp, 
= mzh(—4¢, 2), 
= mz exp(— tk), 
= exp(—tkd)yx,. (16.10) 


Hence , belongs to the representation exp(—1kd) of 00, so that 
if k ~ 0 it is linearly independent of %,. However, from (16.10) 
belongs to the same irreducible vector space as ,, and we have a two- 
dimensional irreducible representation Hy. The character of R(¢, z) is 
exp(ikd) + exp(—ikd) = 2 cos kd, 

and the character of mz is 0 because it just interchanges %, and 
%. For k = 0, %, and , need not be linearly independent. We 
can always form the symmetric and antisymmetric linear com- 
binations ¥, + %,, which give two one-dimensional representations. 
In this way we obtain the complete character table shown in Table 
1l. The theory for the other groups derived from oo is similar. 
The group 002 is isomorphic with com and hence has the same 
character table. Also co/m is the direct product oo x (H, mz), and 
coo/mm is the direct product of oom with the inversion group 
1 =(E£, I). Hence their character tables can be written down 
from the prescription of § 15 or appendix K. 

Similar considerations apply to the finite groups n, n2, n/m, 
mm, 2/mm, etc. However there is the following difference. For 
nm even, k in (16.10) may have the value 4n, so that exp(—ik¢) 
= +1 for all possible values 27r/n of ¢. Hence ys, and ¥, in (16.10) 
need not be linearly independent and we obtain two one-dimensional 
representations instead of a two-dimensional one. In this way 
the irreducible representations of all the point-groups except the 
cubic ones (and the icosahedral ones) can easily be obtained. The 
character tables for these remaining ones can be determined from 
the relations of § 14. 


Double-valued (or spin) representations 

Let u,, u_ be the spin functions transforming according to D\/?) 
under the full rotation group. Now this representation is double- 
valued, with the base vectors changing sign under a rotation of 


138 GROUP THEORY IN QUANTUM MECHANICS 


360° about any axis (§ 8), so that u,, u_ must also transform accord- 
ing to a double-valued representation under any point-group. 
The double-valued irreducible representations of all the crystal 
point-groups have been tabulated by Koster (1957). 

We shall now show how the double-valued representations may 
be found, following the method of Opechowski (1940) based on the 
earlier work of Bethe (1929). These representations can be found 
for the cyclic groups n, the dihedral groups n2 and associated 
groups by using half integer values of k in (16.10) and proceeding 
as before. However the following device is more general and con- 
venient. Consider a point-group g of proper rotations, and the 
corresponding group © of matrices taken from the representation 
D/2) (8.24) of the full rotation group. Clearly these matrices satisfy 
all the group requirements, because (8.24) forms a representation. 
However because of the + or — sign in (8.24), ie. the double- 
valuedness of D“/2), ® contains twice as many elements as g does. 
Consequently any single-valued representation of © automatically 
furnishes a double-valued representation of g. The single-valued 
representations of © may of course be found by the methods of 
§ 14 in a straightforward way. The group of matrices © is called 
the double-group of g. 

The procedure becomes clearer by considering as a specific 
example the point-group 32 of transformations (4.13). Let B be 
the 2 x 2 matrix (8.24), taken with a plus sign, which represents 
the rotation R(%7, z), ie. the transformation B’ of (4.13). Let 
B be the same matrix taken with a minus sign. Similarly let Z, 
E, A, A, K, RK, L, L, M, M be the other elements of 6. Now sup- 
pose we have relations KA = L, etc., among the elements of 6. 
These must correspond to the relations K’A’ = L’, etc., among the 
rotations of g. Thus if we have a representation D of the double 
group © so that D(K)D(A) = D(L), etc., then this also gives a 
representation of the original point-group 9, which however will be 
double-valued because we must associate both D(A) and D(A) 
with the same rotation A’. We shall not write out the whole multi- 
plication table of 6, but typical relations are 


Be = A, B = &, Bt= A? = B, A? = Be = EH, 
K?=H=R*% KK=E=K‘. (16.11) 
Notice in particular that if successive rotations like B* or K* add 


up to a rotation of 360° about one axis, they are always equal to 
E. The elements can easily be collected into their six classes 


(Z), (#), (A, B), (A, B), (K, L, M), (K, L, M). 
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The classes are very much as before, e.g. A and B form one class. 
Now 6 contains 12 elements and 6 classes, so that from (14.13), 
(14.6) there are four one-dimensional and two two-dimensional 
irreducible representations. Some of these can be written down 
immediately. Suppose we have an ordinary single-valued irreducible 
representation 4 of g. Then we can obtain an irreducible re- 
presentation of © by associating the matrix 4(R’) of 4 with each of 
the elements R and R& of G. Thus from Table 8 we obtain the 
representations I’,, I, I's of Table 12. Consider now the element 
#. It commutes with every element (matrix) of G, and hence by 


TABLE 12 
Character Table for the Double-group of the Point-group 32 


E B A,B A,B KLM RIM 
ae A es ee 
I, 1 1 1 1 1 1 
rT, 1 1 1 1 —] a | 
I, 2 2 = | =i 0 0 
rT 1 =] 1 =] a zag 
r. 1 —1 l =1 —i é 
cE; 2 —2 oT 1 0 0 


eee 
I,, I, Py give the single-valued representations of the group 32 which are 
identical with %, and I of Tables 3 and 8 and with A,, A, and E£ of 
appendix K. I, I, I’, give extra double-valued representations. 


Schur’s lemma (appendix D) it is represented in any irreducible 
representation by a multiple cH of the unit matrix EH. Since E? 
= H#, we have c= +1. Since R = RE for any R, choosing c = 1 
gives a representation of the type already considered with x(R) 
= x(#) for all R. Thus to obtain further representations we must 
choose ¢ = —1. Consider first the one-dimensional representations. 
Since K* = #, we must have x(K) = —x(K) = +t. The relation 
KA = L then gives y(4) = —y(A) = —1, whence we obtain the 
representations I,, I, of Table 12. I, may then be obtained from 
the relation of problem 14.8. This then gives all the double-valued 
and single-valued irreducible representations of the point-group 
32 by taking x(R), x(f) together. No very systematic or generally 
adopted method of labelling the double-valued ones exists at present, 
and they are usually just numbered consecutively. 

We now formulate some general rules which aid considerably in 
finding the double-valued irreducible representations of any 
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particular point-group or other finite group 9. We first note that the 
device of constructing the double-group 6 of matrices (8. 24) is 
quite general. Let R’ be any rotation of g. Then (8.24) gives the 
two matrices R and R of 6G, and any irreducible representation 
D of & gives a double-valued irreducible representation D(k), D(R) 
of 3. In working out a character table, the first job is to collect 
the elements into classes. Now since the matrices R, R of G form 
a representation of g, they multiply in exactly the same way as the 
rotations R’ of g, except for the possible introduction of minus 
signs. For instance a relation 


P’R'(P’) = S' for g 
which shows that R’ and S’ belong to the same class, becomes 
P(R or R)P-1 = S or S for G. 


Thus the class structure of G is very similar to that of g. To be 
precise, if a set of rotations R’ form a class of g, then the matrices 
R, R form one or two classes of 6. It is not clear from the above 
discussion whether the set R, R makes up one class or breaks up 
into two classes. This can be decided as follows. From (8.24), the 
matrices R and R have characters 


x(R) = —x(R) = 2 cos $6 cos (x + 4). (16.12) 


Since they have different characters, they cannot belong to the 
same class, i.e. we obtain two classes. By suitable choice of angles 
in (16.12), e.g. choosing ¢ = —120° or +240°, we |can always 
arrange that the R matrices form the one class and the & the other, 
which gives the first rule: 


I. If a set of rotations R' form a class of 3, then the matrices R 
form one class and the matrices R another class of ®. 


The above argument does not work when the angle of rotation 
of R’ is 180°, because then the character (16.12) becomes zero. 
In this case R and R may or may not belong to the same class. 
Suppose that they do. Then there exists some element S such that 
R = S-'B8, ice. 

SR = RS. (16.13) 


Let us choose the rotation axis of R’ as the z-axis. Then from 
(8.24) R takes the form 
+ ; 
R= | ar 
a | 
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If we take the general form (8.24) for S and substitute in (16.13), 
we find 6 = 180°. This means that the z-axis (Fig. 6) gets inverted, 
so that S is a rotation by 180° about an axis perpendicular to the 
z-axis. The argument can also be applied in reverse, and we have 
as our second rule: 


II. There is one exception to rule I. If the rotations are through 
180°, then R and R belong to the same class of the double-group if, 
and only if, there is also in the group another rotation by 180° about 
an axis perpendicular to the axis of R. 


Any representation D(R’) of g gives a representation of © by 
associating D(R’) with both R and R, but this does not give a 
double-valued representation of g except in a very trivial sense. 
To obtain the extra irreducible representations of © which give 
genuine double-valued representations of g we must have the 
element H# represented by minus the unit matrix as already proved 
in connection with the group 32. We therefore have from R = RH 
the third rule: 


III. For the extra irreducible representations of the double-group, 
we have - 
x(k) = —x(R), 
and in the exceptional case of rule II, x(k) = 0. 


The above procedure can only be applied as it stands to proper 
point-groups. However any point-group containing improper 
rotations is either a direct product of a proper group and the inversion 
group 1, or it is isomorphic with a proper group which is obtained 
by replacing each improper rotation by the corresponding proper 
one. The former case can be handled using § 15 and equation 
(11.23), and the latter case by considering the isomorphic proper 
group. We can therefore reduce the representation given by any 
set of functions, including spin-dependent functions, with respect 
to any point-group. 


References 


Bethe (1929) and Opechowski (1940) discuss the derivation of the 
double-valued irreducible representations. Koster (1957) has 
tabulated these for the crystal point-groups. A derivation and 
description of the crystal point-groups are given by Seitz (1934), 
Zachariasen (1945) and Murnaghan (1938). Murnaghan (1938) also 
discusses the icosahedral group. 


142 GROUP THEORY IN QUANTUM MECHANICS 


Summary 


We have tabulated all the thirty-two crystal point-groups (Fig. 
9 and appendix J), and the characters of their single-valued irreduc- 
ible representations (appendix K). We have shown how the single- 
and double-valued irreducible representations of any point-group 
may be obtained. We have discussed the groups derived from the 
axial rotation group and derived their character tables (appendix 
L). We have also shown how the crystal point-groups may be derived 
from first principles. 


PROBLEMS 


16.1 Determine the classes of the point-groups 32, 422, 622, 
and hence calculate their character tables by the method used 
for 002 in the text. 

16.2 Show that 422 and 4m2 must have the same character tables. 

16.3. Write down four functions which transform respectively 
according to the four one-dimensional representations of oo/mm. 
Also write down five functions whose maximum symmetry groups 
are respectively 00, co/m, 002, com and oo/mm. 

16.4 An improper point-group contains the inversion element 
IT. Show that the group is a direct product g x I, where g is a 
proper point-group. If an improper point-group does not contain 
the inversion J7 as an element, show that an isomorphic proper 
point-group can be constructed from it by replacing every improper 
axis 7 by the corresponding proper axis ». Hence derive all the 
improper tetragonal point-groups of Fig. 9 from the proper ones of 
Table 10. 

16.5 The functions ¢;,+ = 1, 2, 3 and w,, w_ transform accord- 
ing to D™ and D@/) respectively under the full rotation group. 
According to what representation (in irreducible components) do 
the products ¢,u, transform under the point-group 32? 

16.6 A set of functions transform according to DY) under the 
full rotation group. Let us also use D™) for the representation of 
the group 32 which they form. With this notation, show that 


DY = DU-O499 4997 447 
for J = an integer >6 in the notation of Tables 3 and 8, and that 
DY) = DW-a +. y+ r, + 2F, 


for J = half an odd integer >3 in the notation of Table 12. 
16.7 Construct the characters of the extra double-valued 
representations of the group 422. 
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16.8 Rand S are any two elements of a group in the same class. 
Show that there exists an element T such that 7-'ST = R. 

16.9 Show that in any representation x(R)* = x(R-1). What 
are the consequences of this for the characters of 180° rotations 
in single-valued and double-valued irreducible representations 
(Opechowshi 1940) ? 

16.10 Show how to construct polynomials of degree n which 
transform according to a given representation of a point-group 
(Olson and Rodrigues, 1957). 


17. The Relationship Between Group Theory 
and the Dirac Method 


In this book we have developed the group theoretical method for 
sorting out and labelling a complete set of functions, usually the 
eigenfunctions of a Hamiltonian. This contrasts at first sight with 
the more usual procedure, developed by Dirac (1958), in which one 
uses as a complete set of functions the simultaneous eigenfunctions 
of a set of commuting operators (see for example Schiff 1955, p. 143). 
The purpose of the present section is to relate these two approaches, t 
and show that they are completely equivalent. 

In brief, Dirac proceeds as follows. Let % be an eigenfunction of 
two operators A and B belonging to eigenvalues a, b; 


Ay = ap, Bis = by. 
It follows that 


ABYy = Abp = ba = aBy = Bay = BAY, 
i.e. (AB — BA) =0. 


This suggests that simultaneous eigenfunctions like ~ are most 
likely to exist if AB — BA = 0, ic. if the two operators commute. 
In fact it can be shown (Dirac 1958, p. 49) that the simultaneous 
eigenfunctions of two commuting operators form a complete set of 
functions. By using several commuting operators, we can arrange 
it that the sets of eigenvalues of two different functions are always 
different. This then gives a definite way of achieving a sorted and 
labelled complete set of functions. One of the operators is usually 
chosen to be the Hamiltonian, so that the eigenfunctions are sorted 
out according to their energies. In this case the other operators are 


+ The results of this section are not used elsewhere in the book. It has been 
included for the benefit of those readers whose original introduction to quantum 
mechanics was through Dirac’s book, or who for other reasons like to think 
in terms of complete sets of commuting operators, 
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constants of the motion in the quantum mechanical sense. In 
detail, for any operator A not depending explicitly on time, we have 
(Schiff 1955, p. 134) 


d 1 
qi <A> = 9 4% — HAD, (17.1) 


where <A) is the expectation value f* Ay dr of A. Clearly <A> 
is constant for any state %(t) if A commutes with #, and A is 
called a constant of the motion accordingly. 


Continuous groups 


The relationship between the group theoretical and the Dirac 
methods of labelling eigenfunctions is very simple when the Hamil- 
tonian # is invariant under a group of transformations forming a 
continuous sequence in terms of some co-ordinates g;. Suppose 
H# does not depend on q,. Then if we regard # as a classical Hamil- 
tonian the momentum p, conjugate to q, is a constant of the motion 
because 


Quantum mechanically, », commutes with all p; and all q; except 
q,. Since # does not involve q,, p, also commutes with #. From 
(17.1) it is a constant of the motion just as in the classical analysis, 
and may be taken as one of the set of commuting operators. As a 
simple example we may cite the case of the electron of a hydrogen 
atom in a uniform magnetic field H in the z-direction. The Hamil- 
tionian (neglecting spin) is 
h? e eh la Oe ear a 

on) Toe © Omee oa amok sin? 6. (17.2) 
This is not exactly in canonical form, but it does not depend 
explicitly on ¢. Thus the conjugate momentum, namely the angular 
momentum LZ, about the z-axis, is a constant of the motion as we 
would expect classically. The eigenfunctions therefore have a definite 
value of Zz, namely mA (Schiff 1955, p. 75). This illustrates the 
Dirac viewpoint. 

Now group theoretically we would proceed as follows. Since 
H# is independent of q,, it is invariant under all transformations 
9, =Q, + 4q,. Thus # is also invariant under the infinitesimal 
transformation J,. Moreover all the transformations form a con- 
tinuous group. Then if we have a vector space transforming 
irreducibly under the group, we may use J, to pick out in it the 
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functions invariant under J, and use these as base vectors. Now 
these eigenfunctions of J, are just the same ones as we would have 
found using the constant of the motion p, above, since there exists 
the relation p, = Al, (8.29) between them. There is thus a very 
close relationship between the classical result, the Dirac approach 
and the group theoretical procedure. 


Finite groups 

The above discussion completely breaks down when the symmetry 
group of the Hamiltonian is a finite group 6, for in this case the 
infinitesimal transformations do not exist. Consider for instance 
the Hamiltonian (4.12) of an electron moving in the field of three 
protons arranged in an equilateral triangle. What classical constant 
of the motion is there that corresponds to the fact that the potential 
has trigonal symmetry? We could introduce a co-ordinate gq, = I, 
2, 3, 4, 5, 6 corresponding to the six equivalent positions of the 
triangle. Then # is certainly independent of q,, but there is no 
conjugate momentum because gq, is not a continuous variable and 
differentiations such as 0q,/dt, 24#/dq, become meaningless. In fact 
it is not at all obvious what complete set of commuting operators 
to use in this problem to obtain a convenient set of simultaneous 
eigenfunctions. Let us therefore adopt the group theoretical method 
and assume that we have the eigenfunctions ¥;°") of the Hamil- 
tonian # sorted out to transform according to the irreducible 
representations D® of the symmetry group © of #. We shall 
now construct a set of operators P; of which the 44" are simul- 
taneous eigenfunctions. 

Consider the operator 


Py=(ifee) Dd Tr | (17.3a) 
all T in class k 
=(1/h) > STS", (17.3b) 
al Sin G 


where S is any element of G and 7, any element of the k*® class. 
As in § 14, h is the number of elements of G and h, the number in 
the k® class. From (17.3b), P, commutes with every element of 
©. Hence by Schur’s lemma (appendix D) Px is represented by 
ay times the unit matrix with respect to an irreducible set of base 
vectors yi", + = 1, 2,.... Hence using (17.3a) and taking the 
sum of diagonal matrix elements 


ayn, = (1/hx) 2, xT), 


i.e, ay= xe ry, (17.4) 
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where n, is the dimension of D®, Thus yf; is an eigenfunction of 
Px with the eigenvalue (17.4). Since the characters x,, k = 1, 
2,... characterize an irreducible representation completely, the set 
of eigenvalues a, (17.4) serve to label uniquely which irreducible 
representation a given y belongs to. Furthermore since P; commutes 
with any 7, from (17.3a) it commutes with any P;. Now any element 
T of © is a symmetry transformation of #. Hence operating on 
Hy, it affects only the wave function ys and we have T#ys = AT, 
whence 

TH =HT. (17.5) 


Thus 7' is a constant of the motion, and from (17.3a) so is Pr. 
Our functions ¥,%" transforming irreducibly according to D™ 
are therefore simultaneous eigenfunctions of the set #, P,, Po, . 

of commuting operators. The eigenvalues of the P,; distinguish 
different irreducible representations, and the energies (eigenvalues 
of #°) differentiate between different values of r, e.g. between the 
2p, 3p, 4p, . . . wave functions in an atom since these all trans- 
form according to D“ under rotations. The operators P, cannot 
distinguish between the different y,°", i = 1, 2, .. . of one irreducible 
representation, but some more operators can easily be devised to 
do this too (problem 17.3). 


Summary 


The group theoretical method consists of sorting out the eigen- 
functions of the Hamiltonian # according to the irreducible re- 
presentations of the symmetry group of #. We have shown that 
this is completely equivalent to the Dirac method, in which we set 
up a complete set of simultaneous eigenfunctions of # and other 
commuting constants of the motion. 


PROBLEMS 


17.1 In the notation of the text, show that J, is a constant of 
the motion without using the relation P, = Al,. 

17.2 Show that the two expressions in (17.3) are equal. 

17.3 With the notation of § 14, consider the operators 


Qn = 2 Dy*(T)T, 


where the summation is over all 7’ in the group, and ¢ is not summed. 
Show that Q,, commutes with any Q,,, any P, (17.3), and with the 
Hamiltonian. What is the effect of Q,, on an eigenfunction y,'#") 
of the Hamiltonian? Hence discuss the use of the Qu,» With or 
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without the P;, as a complete set of commuting constants of the 
motion. 

17.4 An electron is moving in a potential whose maximum 
symmetry is the point-group 32. Set up the minimum number of 
commuting operators forming a complete set so that a simultaneous 
eigenfunction is uniquely determined by its set of eigenvalues. 
Give the operators and eigenvalues in detail. 


Chapter IV 
FURTHER ASPECTS OF THE THEORY 
OF FREE ATOMS AND IONS 


In Chapter II we outlined the quantum theory of free atoms, 
the main purpose there being to illustrate the use of group theory 
in quantum mechanics. In the present chapter we shall apply 
group theory to some more specialized aspects of the theory of 
atoms, and then in the remaining chapters to a selection of topics 
from other branches of physics. 


18. Paramagnetic Ions in Crystalline Fields 


Introduction 


Most atoms when they form ions, such as in a crystalline salt, 
achieve a closed shell configuration either by losing their valence 
electrons (metallic ions) or building up an incomplete shell (negative 
ions). Such closed shell ions are therefore in a 4S state, which is 
diamagnetic, non-degenerate and generally does not give rise to 
interesting effects. However, the transition and rare earth metals 
are exceptions to this rule because of their inner, incomplete 3d, 
4d, 5d, 4f, 5f shells. As free ions these in general have paramagnetic, 
degenerate ground states which in crystals get split by the electric 
fields. In a crystalline salt, such an ion finds itself surrounded by 
some regular arrangement of other ions and water molecules. 
Now strictly one should consider the ion and its neighbours as a 
big molecule, and discuss their interaction in terms of covalent 
and other bonds, etc. However it has been found a good approxi- 
mation to consider the neighbours as simply giving rise to an 
electrostatic potential, the crystalline field, which acts on the para- 
magnetic ion. This field, combined with the spin-orbit coupling, 
splits the ground term into a sequence of levels. Since 1946 para- 
magnetic resonance has been used to study in great detail the lowest 
of these energy levels and their variation in an applied magnetic 
field. Consequently there is considerable interest in making accurate 
calculations of these levels. Comparison between the calculated 
and observed levels has then determined the magnitude of various 
parameters such as the strength of the crystalline field, and has 
also brought to light some refinements that are required in the 
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quantum theory of atoms. Reviews of the subject have been given 
by Bleaney and Stevens (1953) and by Bowers and Owen (1955). 

In Chapter II, the use of symmetry properties led to very broad 
general results, applicable to any free atom or ion, but in this 
section we shall use group theory in quite a different role. Because 
of the different ground states of the paramagnetic ions and the 
different crystal structures, each salt almost has to be discussed 
separately. Group theory is used to help calculate particular 
matrix elements and splittings as required. In the present section 
we shall therefore not attempt at any general presentation but 
consider two particular salts, cerium ethylsulphate and chromous 
sulphate, which suffice to illustrate most of the ideas. The energy 
levels are determined by setting up the Hamiltonian matrix and 
solving the secular equation, often using perturbation theory 
(Schiff 1955, p. 128). In detail this involves a long and complicated 
calculation, so that we shall only show in outline how group theory 
is used to calculate the type of matrix elements that are required 
during the calculation, and what the kind of splitting is at each 
stage. 


The crystalline field in cerium ethylsulphate 


The composition of this salt is Ce(C,H,SO,),;.9H,O, and it 
crystallizes in a hexagonal form with two molecules per unit cell. 
The two cerium ions are situated at equivalent special points in the 
cell lying on a three-fold rotation axis, with the water molecules 
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Fie. 10. A diagram of one cerium ethyl sulphate molecule 
Ce(C,H,;SO,);.9H,O. The full circles represent one water molecule 
each in the plane of the paper, and the broken circles each two water 
molecules with one above and one below the plane of the paper. 
The Ce+++ and C,H,SO,~ ions are also in the middle plane. 


ens, 


and ethylsulphate ions arranged round them as shown in Fig. 10. 
The cerium ion and water molecules have symmetry 6m2, but the 
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arrangement of atoms in the ethylsulphate radical does not have a 
vertical mirror plane, so that the overall symmetry is 6. Let us 
consider the potential energy V-(r) of an electron belonging to the 
cerium ion, due to the electrostatic field of all the water molecules 
and other ions. Since the charge density of the other ions is zero 
over the region of the Cet++ ion, we can write V, in terms of spherical 
harmonicst Y;", 

Ve =e Amr VY ™(8, ). (18.1) 

7 ™ 


Since R(d, z)Y¥;" = em¢Y,™, we must have 
A,™ = 0 unless m = 0, +3, +6,... (18.2) 


for V,_ to have three-fold symmetry. Since the group 6 contains 
the mirror plane mz, (18.1) must contain only even powers of z, 


and we have 
A,™ = 0 unless 1 — m = even. (18.3) 


The cerium ion has the configuration 4f, and if we neglect mixing 
with other configurations having some 40,000 cm higher energy, 
all matrix elements of V; involve the integrals 


f past Yimday dr (18.4) 


where ¢,, is a 4f orbital. Since the ¢,,’s transform according to 
D‘®) under rotations, the Y,"¢4, transform according to 


> D#, |t—3| $j S143; 


and from the fundamental theorem (13.8b), the integral (18.4) 
is zero unless the values of j include j = 3, i.e. unless / < 6. Similarly 
Y,™ must have even parity, i.e. we have 1 = even. It follows from 
these two restrictions and (18.2), (18.3) that we need only retain 
the constants A,°, A,°, A,°, A,®, A,g®, Ag—*, with the condition 
A,-* = A,§* so that (18.1) is real. The origin of the ¢ co-ordinate 
can now be chosen to make A,® real, and expressing the r!Y;™ 
in terms of 2, y, z, we have 


Ve = By? + B32? — r?) + By(3524 — 30r?2? + 3rt) + 
+ B,(2312z° — 315r2z4 + 105r4z? — 5r8) + 
+ B,8(x® — 15aty? + lda*%y4 — y$), (18.5) 
where the normalization constants have been absorbed into the 
B’s. Because we are restricting ourselves to the lowest configuration, 


+ In this section we write the spherical harmonics Y;™ and other quantities 
with m as a superscript instead of a subscript as in § 8, in accordance with 
customary usage in this particular subject. 
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(18.5) actually has the higher symmetry 6m2, rather than the 
required 6. This 6m2 is therefore the symmetry of the effective 
Hamiltonian for the cerium ion. 


Energy level splittings in cerium ethylsulphate 


The lowest configuration of Ce+++, namely 4f, contains only one 
term ?F, and in the free ion this is split by the spin orbit coupling 
into two levels J = 5/2 and J = 7/2. By Hund’s rule (§ 12) the 
J = 5/2 level lies lowest. The separation between the levels is 
about 2000 cm-1, whereas the splittings produced by V- are about 
200 cm-, so that it is a fair approximation to treat V, as a pertur- 
bation. We shall therefore consider the splitting of the J = 5/2 
level caused by V, using first order perturbation theory. A more 
accurate theory is possible, and indeed necessary to interpret the 
observed data completely: this involves considering the spin- 
orbit coupling and V, simultaneously, and solving the secular 
equation for the whole 14 states together without using perturbation 
theory (Elliott and Stevens 1952). Considering the perturbation 
(18.5), we note that B,° shifts all levels equally so that we shall 
neglect it. Also the B,° and B,® give only zero matrix ele- 
ments among the J = 5/2 manifold of states, because the products 
Y_™(J = 5/2, Mz) transform.according to 


> DD, J = 7/2, 9/2, ..., 17/2, 


which does not contain J = 5/2. We are therefore left to consider 
the perturbation 
¥ Vielrs) = > (Ver) + Vs] (18.6) 
t a 
to the Hamiltonian, where the summation is over all the electrons 
of the Ce**+* ion, and where 


2° = B,(32? — r2), (18.7) 
V ,° = B,(35z4 — 30r2z? + 3r4). (18.8) 


The perturbation (18.6) is invariant under rotations about the z- 
axis, so that the quantum number My, = M can still be used to 
label the states. Also (18.6) is invariant under a two-fold rotation 
about the x-axis, so that the states (1), ¥(—) have the same 
energy (cf. § 16 and appendix L). The relative energies of the three 
doublets are therefore 


Ir 
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where we have used the Dirac notation (13.9) for the matrix elements. 
Since the Ce+++ ion has an odd number of electrons, Kramers’ 
theorem (§ 19) shows that the splitting into doublets is the greatest 
splitting that can occur, and higher order effects from (18.5) can 
only shift the levels about. 

Let us consider the first term in (18.6). Since > 32? — r;2 and 
3J, — J* transform} in the same way under rotations, their 
matrix elements are proportional inside a given irreducible manifold 
of states by the argument of § 13 and 


(M5 V,9|M’> = (M|o[3J.2 — J(J + 1)]|M’>. (18.10) 
Hence the contribution of V,° to the energies (18.9) becomes 
—8o(M = +4), —2a(M = +8), 100(M = +8). (18.11) 


It remains to evaluate «. Let (mj, m,) be a single determinant wave 
function for all the electrons in a Cet+* ion, where m ;, m, denote 
the quantum numbers of the single 4f electron outside closed shells. 
We shall make the convention that when we use a single quantum 
number it refers to My, and a pair of numbers to m;, ms. The 
functions (mj, ms) transform according to D® x D@/®) under 
rotations, and we can pick out the linear combination 


where a and b are Wigner coefficients as in (9.7). They are (appen- 
dix I) a = 4/(6/7), 6 = —+/(1/7). Thus 


</> Ve°ld> = a7<3, —$|> V2°|3, —$> + 67<2, 4|> V2, 4. 
(18.12) 


Since 3z2 — r? and 31,2 — (1 + 1) transform in the same way under 
rotations, we have 
<3, —312 V2°|3, —4> = <3, —4/ BD [Bles? — (le + 1)]|3, —4> = 198, 
<2, 4/2 Vo°l2, 4> = <2, 4/8 > [Blaa? — L(t + 1)]|2, 4> =0, 
(18.13) 
where the closed shells do not contribute (problem 13.12). Further if 


$(3, —$) = Fy,(r) sin? 0 e?Fu_ 


{ Throughout this section we shall drop the factor h% from all angular momenta 


for the sake of custom and convenience. Thus we use J,,), , for I, 14D, orb, 1» Cte. 
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is the m; = 3, ms = —} orbital of the 4f electron, we have 


<3, —+4]> V,°(3, —) 
B,° [2 Fr! dr [7 sin® 6(3 cos? @ — 1) sin 6 dé 
fe Fr dr fj sin® 0 sin 6 d 6 


= —4B,"r7, 


(18.14) 


where Tr? is the expectation value of r? for any 4f state. Now com- 
bining (18.11), (18.12), (18.138) and (18.14), we have a = —(2/35) 
B,°r?. Similarly the matrix elements of V,° (18.8) may be cal- 
culated, and the final energy levels are shown in Table 13 (problems 


TABLE 13 
Energies and g-values of Lowest Doublets 


Doublet Energy 9s Ix» Ty 
M 
+4 —8a + 2y $ a 
+# —2a — 3y ee 0 
+¢ 10a + y a9. 0 


a = —(2/35)B,°r* and y = (8/21)B,°rt. 


18.1, 18.2). The method we have used to calculate such elements 
by replacing x, y, z by Jz, Jy, Jz or lz, ly, lz, is called the method of 
operator equivalents. It has been expounded in detail by Stevens 
(1952), who also gives some useful tables. 


Cerium ethylsulphate g-values 


We consider now the effect of a small magnetic field H on the 
energy levels. The perturbation to the Hamiltonian is 


6p H-(L + 28) (18.15) 


where § is the Bohr magneton ef/2mc (Schiff 1955, p. 293). This 
usually splits each doublet, and the splitting 47 is usually expressed 
in the form 

AE = g(M)pH, (18.16) 


where g is a factor which we shall now calculate. The operators 
L, S and J all transform as vectors under rotations, so that the 
matrix elements of their components are proportional to one 
another inside an irreducible manifold of states belonging to one 
value of J. Thus the perturbation (18.15) can be written g,BH- J 
where gz = 6/7 is the Landé splitting factor calculated in problem 
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13.6. With H in the z-direction, each energy level is increased by 
gt8HM, whence comparison with (18.16) gives 


gz(M) = 2971|M|. 
For H along the x-axis, we have 
(+£8/2|J2|+3/2> = ¢£5/2|J2|45/2> = 0 


and no first order splitting of the M = 3/2 and M = 5/2 doublets. 
From (8.18) we have 


<—4)Ja2lh> = 4-4/7, +d_|b, 
= }JVJ +1) — M(M — 1)p with M = 4, 


Thus for the +4 doublet the matrix of the perturbation (18.15) is 


9 ead 

gee F | 
and the energy levels are +-(9/7)BH, whence gz = 18/7. From 
symmetry, gy = gz, and we have the numerical values shown in 
Table 13. 

Paramagnetic resonance experiments at low temperatures show 

that in cerium ethylsulphate the lowest doublet has g, = 0-955, 
9x = Jy = 2-185, and that there is another doublet about 3 cm-} 
higher with gz = 3-72, gz = gy = 0-2. Comparison with the g-values 
‘of Table 13 shows that we must identify these levels with the 
M = +} and M = +5/2 doublets respectively. The discrepancies 
between the calculated and observed g-values are due to the per- 
turbing influence of the J = 7/2 level some 2000 cm-! higher and 
the B,°, B® terms in (18.5). Magnetic susceptibility measurements 
indicate the presence of another level about 130 cm-! above the 
two low lying doublets, and this must be the M = +3/2 level. 
Comparing these figures with the relative energies of the levels 
shown in Table 13, we obtain 


B,°r? = 25 em-}, B,°rt =: —74 cm-}, 


These values are again rather rough, but they illustrate the kind of 
way in which the magnitudes of the crystal field parameters can be 
obtained from experiment. Elliott and Stevens (1952) have given 
a much better fit to the data, and determined the values of all the 
constants in (18.5) (except By°). In particular B,* turns out to be 
large and important, as we might expect from the fact that it alone 
reflects the trigonal arrangement of the water molecules, all the 
other terms being axially symmetrical. 
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Crystal field splittings in chromous sulphate 


This salt CrSQ,.5H,O appears to have the same structure as 
copper sulphate. The Crt+ ion is at the centre of a square of four 
water molecules with two oxygen ions centrally located above and 
below the square. The six oxygen atoms form approximately 
a regular octahedron, and the crystalline field consists predominantly 
of a large cubic component with symmetry m3m, and a smaller 
tetragonal component with symmetry 4/mmm. In addition a 
distortion of the square of water molecules gives a still smaller 
orthorhombic (usually abbreviated to “rhombic”’) field with sym- 
metry mmm, whose effect is comparable with that of the spin- 
orbit coupling. We shall now discuss qualitatively the energy level 
splittings on the basis of these symmetries and orders of magnitudes. 

The chromous ion Cr++ is in the configuration (3d)* whose lowest 
term is 5D (§12). Under the influence of the cubic field, the Hamil- 
tonian is invariant under the group of rotations m3m of the orbital 
variables, and under all spin rotations. The 2S + 1 = 5-fold spin 
degeneracy of each orbital level therefore remains. The unperturbed 
orbital functions transform according to D‘®) under rotations, and 
this is reducible into the representations HZ, and 7, of the cubic 
group (Table 14). The characters of this representation with respect 


TaBLeE 14 
Irreducible Representations for Splitting of 5D Term 


Cubic field, point-group m3m. D(?) (rotation group) = Eg + Fag. 


x(2) x(3) x(2 2) x(2a) x(42)—s parity 
rotation 
group i § —1 1 1 -—1 even 
D() ‘ 
Ey 2 —1 2 0 0 even 
Tse 3 0 —l 1 —l even 
Tetragonal field, point-group 4/mmm. E,(cubic group) = Ayg + Big. 

xiZ) x22) x(2z) xa) x(4e) parity 
cubic ; 
group 2 2 2 9 0 even 
Ey 
Ay 1 1 1 1 1 even 
By 1 1 1 —1 —1 even 


Rhombic field, point-group mmm. 

Orbital function: B,g (tetragonal group) = Ag (rhombic). 

Spin functions: D(@®) = Ay + Ay + By + Bag + Bag. 

Complete functions: Ay x DE) = Ag ++ Ay + By + Bag + Bog. 
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to the group m3m are obtained from (14.4), the irreducible re- 
presentations of m3m from appendix K, and the reduction of the 
representation from (14.2). The L = 2 term is therefore split into 
an orbital doublet and an orbital triplet, and it so happens that the 
doublet lies lowest (Fig. 11). Under the action of the tetragonal 
field the lower doublet splits into two orbital singlet states trans- 
forming according to the representations A,, and B,, of the group 
4/mmm (Table 14). We shall suppose that the B,, level lies lowest 
(Fig. 11). The states of this level transform under orbital rotations 


3x5 < 
Tog 
5x5 
5D 
1x5 
ax5 / Aw 
Es x5 =i} 2A, + By 
B,, || +8,, +B; 
Free ion Cubic Tetragonai Rhombic field ond 
field field spin orbit coupling 


Fia. 11. Splitting of 5D term by crystalline fields. The degeneracy 
and symmetry of each level is indicated. 


according to the representation B,, of the group 4/mmm, which 
becomes the representation A, of the group mmm. Under spin 
rotations the states transform according to D'S) with S = 2, and 
under orbital and spin rotations together according to Ay x D®), 
which is reducible into five one-dimensional representations (Table 
14). Thus under the combined influence of the rhombic field and 
spin-orbit coupling, we finally obtain five non-degenerate levels 
(Fig. 11). 


The spin-Hamiltonian for chromous sulphate 


The splittings produced by the cubic and tetragonal fields can 
easily be calculated in the same general way as the energies in 
Table 13, using the method of operator equivalents as discussed 
by Stevens (1952). We shall only concern ourselves here with the 
relative energies of the five lowest levels shown in Fig. 11, because 
it is these energies that can be studied experimentally as functions 
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of the strength and direction of an applied magnetic field. Now it 
is not usually possible to give analytic expressions for these energies 
as functions of the field, so that instead the theoretical and experi- 
mental results are conveniently interpreted in terms of a 5 x 5 
matrix whose eigenvalues are the required energy values (Schiff 
1955, p. 128). This matrix, which is called the spin-Hamiltonian, 
we shall now proceed to calculate. 

We start with the 1 x 5-fold degenerate level transforming 
according to By, x D‘ under orbital and spin rotations of the 
group 4/mmm (Fig. 11). We apply the rhombic field, the spin- 
orbit coupling and the magnetic field as a perturbation. Since all 
three produce effects of the same order of magnitude (1 em-), 
they must be treated together as a single perturbation. To be 
precise, the rhombic field and AL -S produce no splitting of the 
level in a first order of approximation, but they are intrinsically 
large enough for their second and higher order terms to be com- 
parable with the first order magnetic field splittings. All other 
levels in the tetragonal field approximation lie considerably higher, 
so that perturbation theory can be used. The rhombic field can be 
expanded in terms of spherical harmonics (18.1), and we shall 
consider only the typical term 


Vin = > A(x? — yi’). 
r 


The spin-orbit coupling takes the form AL +S (13.20), since we 
shall neglect the effect of all levels not arising from the *D term. 


With the magnetic interaction (18.15) the whole perturbation 
becomes 


Hy = Vr, + AL+S + BH: (L 4+ 28). (18.17) 


We shall denote the unperturbed states, i.e. the states in the 
tetragonal field approximation of Fig. 11, by two quantum numbers 
n, M. Here n denotes the orbital state, numbered from the lowest 
one (B,,) as zero, and M is the spin quantum number Ms, —2 < 

S 2. As a further piece of notation the matrix elements of some 
operators can be written in a simplified form. For example in the 
operator L,S,, Lz operates only on orbital variables and S, on 
spin variables. Thus (problem 18.5) 


«nM | L,Sz|n'M’> 
= <n" |Lz|n'M") <n" |Sz|n"M'> 
= <n|Lz|n")><M|8z|M’), 


(18.18) 
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where in the last line we have dropped the quantum numbers 
n”, M” since the matrix elements are independent of them. We 
shall follow this convention throughout. 

We now apply the perturbation (18.17) to first order, the energy 
levels being obtained from the 5 x 5 matrix 


{OM | Hp | OM’). (18.19) 


Since V,, is spin-independent, it gives a constant diagonal con- 
tribution to (18.19) which shifts all energies equally. This we shall 
neglect. The spin-orbit coupling gives 


A > <O}L,|0><M |S; |. 
L,Y .2 


This is zero because <0|Z;|0> is zero, as we shall now prove. Suppose 
that 4 = dornUspin, or an antisymmetric linear combination of such 
functions, satisfies a spin-independent Schrédinger equation as 
our unperturbed states do. In the absence of magnetic fields, all 
terms in the Hamiltonian are real. Thus if ¢dornUspin satisfies the 
Schrédinger equation, then so does ¢orp*Uspin. Now our ground 
state is orbitally non-degenerate, so that we must have ¢* = a¢ 
where « is some constant. By incorporating in ¢ a suitable phase 
factor, we can choose ¢ real. Hence 


; a a 
<0|Lz|0> = th { (25 + vz) dr =, (18.20) 
where a@ is real. But from (8.17a), we have 
<O[Le|0>* = [f gttep do]* = [f (L)*$ do]* 


= | #*Ulp) dr = <0|Ze|0), 


and hence <0|Z,/0> is real. Comparing this with (18.20), we 
conclude 
<0|L,|0> = 0, t= 2, Y, 2. (18.21) 


We next calculate the magnetic term in (18.19). The contribution 
from BH - L is zero because of (18.21), and we are left with 


HY = > 2BH KM |S;|M’D. (18.22) 
q 


If H is in the z-direction, (18.22) is diagonal and the energy levels 
to this order of approximation are 


Ey = 2pHM. (18.23) 
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Now a system with only orbital angular momentum has magnetic 
energy levels BHM, and gives a contribution to the susceptibility 
the same as that of a classical magnetic moment of strength 
p =[L(L + 1)}"8. Similarly a free ion with spin has a magnetic 


moment 
w= grlJ(J + 1))"8, (18.24) 


where gy is the Landé splitting factor (problem 13.6). However 
in the crystalline field, from (18.23) the Crt+ ion has a magnetic 


moment of 
pw = ASS + PB, (18.25) 


as if only the spin and not the orbital angular momentum contri- 
buted to the magnetic moment. This situation is described by 
saying “the orbital moment is quenched’”’. Table 15 compares the 
TABLE 15 
Magnetic Moments of Paramagnetic Ions 


Config- Lowest p(cale) p(calc) 


Ton uration level free ion spinonly (exp) 
Titt++, Vat 3d 2Dsje 1-55 1-73 1-8 
Vttt 3d2 8K, 1-63 2°83 2-8 
Crt++, Vtt 3d3 “Peis 0-77 3°87 3:8 
Mntt++, Crt+ 3d 5Do 0-00 4-90 4-9 
Fet++, Mn++ 3d5 ®S5/a 5:92 5-92 5-9 
Fett 3dé 5D, 6-70 4-90 5-4 
Cott 3d? {Foe 6°63 3°87 4-8 
Nit+ 3ds 3K, 5°59 2°83 3:2 
Cutt 3d 2Dsia 3°55 1-73 1-9 


The values of p» are given in units of the Bohr magneton B = ehi/2me. The 
free-ion values are calculated from (18.24) and the spin-only ones from (18.25). 
The experimental values are for representative salts (Kittel 1956). 


observed magnetic moments of transition metal ions in salts with 
the calculated free-ion value (18.24) and the spin-only value (18.25). 
The quenching of the orbital moment is seen to be a general pheno- 
menon for such ions, and the physical reason for it is as follows. 
In a free ion the states of different 1; have the same energy, and 
in a magnetic field the atom tries to line up its magnetic moment 
with the field and hence to take up the state with minimum J. 
However, in a crystal this is impossible, because the states with 
different MU, have quite different energies. Their charge distributions 
differ, and there is one lowest orbital state whose charge distribution 
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avoids as much as possible the electrons from the nearest neigh- 
bouring atoms. In this way the Coulomb repulsion between the 
two sets of electrons is lowest. The ion is therefore not free to take 
up the state of minimum My, and the orbital angular momentum 
gives no contribution to the magnetic moment in a first order of 
approximation. 

We continue with the perturbation calculation of (18.17). Since 
the lowest five unperturbed states ¥(0, M) are degenerate, their 
energies have to be obtained by first calculating a 5 x 5 matrix 


He =H 4HSD4 AL... (18.26) 


where #,")), #,'”) etc. are obtained from first order, second order, 
etc., perturbation theory. This matrix is called the spin-Hamil- 
tonian, and the energies of the states are its eigenvalues (Schiff 
1955, p. 158; Pryce 1950). 4; has already been calculated 
(18.22). We shall continue to express the whole of #, like #,‘) 
in terms of the matrices (M|S;|M'>, i =z, y, z. These matrices 
can easily be calculated from (18.18): for instance (<M |Sz|M’'> is 


1 
1 a, ae 
. ee . eo CS. (18.27) 


Also for the sake of simplicity we shall from now on use Sz for the 
matrix <M|S;|M'> like (18.27), as well as for the more general 
spin operator Sz; = TIz pi, With this notation (18.22) can be 
written 

H')) = 28H -S = 28(H,S, + H,S, + HS,).  (18.22’) 


The second order contribution to the spin-Hamiltonian is 


HA) = — S (OM | |nm><nm|#p|0M"> 
n,m 


Ey — Ky 


(18.28) 


On substituting (18.17) for # and multiplying out the numerator, 
we obtain a number of terms, one of which is 


AL : 
= > SOM ALS, |nm) (nm |BH2L2|0M"> (18.292) 
n,m pane 


This can be expressed in the spin-Hamiltonian form 
$49-BHS, (18.29b) 
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L 2 
where --2 > Loli ein | (18.29¢) 
n—_ 0 


This term and #,') (18.22’) can therefore be collected together and 
written 


Bz 2Sz + gyHySy + gzH2Sz) (18.30) 
where 9 = 2 + Ag. 


Since we have not used V,, so far, the system has tetragonal sym- 
metry and gz = gy. Further, for an incomplete shell less than half 
full of electrons, it can easily be shown from (11.8), (13.16) that 2 
is positive in accordance with Hund’s rule (§ 12). Thus from (18.29¢c) 
and. (18.30), the values of g; tend to be less than 2. For instance in 
chromous sulphate gz = 1-95. For the same reason the observed 
magnetic moments of ions with less than a half full 3d shell tend to 
be smaller than the spin-only values (18.25). On the other hand a 
shell more than half full behaves like a few positive holes, which 
changes the sign of A. This explains why in the lower half of Table 
15 the observed magnetic moments are larger than the spin-only 
values. Furthermore selection rules for <0|Z;4|n> show that there 
is no contribution to (18.29), (18.30) from the level n = 1, so that 
the energy denominator HL, — E, is at least the large splitting 
produced by the cubic field (Fig. 11). Thus the 4g; in chromous 
sulphate are small, of the order of 0-05. 

Similarly, it follows that the term <|L-S|><|L-S|> in (18.28) 


contributes 
2 diz S4S; 


to the spin-Hamiltonian, where dy is a tensor. Because of the 
tetragonal symmetry of the unperturbed system, the tensor must 
have the form 


aSz? + aSy? + b8,2 = D[S22 — 48(S + 1)] + const. (18.31) 


There is a similar term Af$?H,? due to <|BH-L|><|BH-L|> in 
(18.28), which is very small and which we shall neglect. Further, 
selection rules show that all terms in (18.28) involving V,, are 
zero (problem 18.6). In fact it is only in the fourth order that terms 
involving V,,, like 


(OM |Vin|nm><nm|Vrn|n'm’> 
<n'm' |L-S|n"m"><n"m" |L-S|OM'> 
nm,n'm’ (En — Eo)(En — Eo)(En- — Eo) 
nm” 
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give non-zero contributions to the spin-Hamiltonian. Because of 
the rhombic symmetry, we expect some of these to have the form 
(18.31) and some the form H(Sz? — S,*). Similarly, there are very 
small contributions of the form (18.30) with gz 4 gy. Still higher 
orders of perturbation will give other rhombic components like 
S,4 — Sy* which are very small. Thus we may write the total 
spin-Hamiltonian approximately as 


Hs = D[Sz? — 4S(S + 1)] + H(Sz? — Sy?) + 
+ BgcH,Sz + GyHySy + 9:HS,). (18.32) 


The constants are not entirely independent of one another because 
of the way they depend on A, <0|L;/n>, etc. Ono et al. (1954) have 
calculated the energy levels on the basis of this Hamiltonian, and 
compared them with their observed paramagnetic resonance lines. 
From this they obtain the following values of the constants: 


|D| = 2-24 cm-}, |Z | = 0-10 cm-!, 
Gx Jy = 1-99, Iz = 1-95, 


which they show to be compatible with the expected values of A, 
the cubic field splitting, etc. 
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Summary 


We have shown how symmetry properties are used to determine 
the qualitative nature of the splittings due to crystalline electric 
fields in paramagnetic salts. In setting up a quantitative theory, 
it is necessary to calculate a large number of matrix elements of 
various kinds. Here we have shown by discussing two particular 
salts how symmetry properties can be useful in a wide variety of 
ways in calculating these matrix elements, so much so that they 
often enable one to set up a complete theory and derive a spin- 
Hamiltonian involving just a few constants which then have to be 
calculated in detail or found by experiment. 
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PRoBLEMS 


18.1 In writing down an operator equivalent for the potential 
(18.8) 3524 — 30r2z2 + 3r* analogously to (18.10), show that rz? 
must not be replaced by Jz2J(J + 1), but by the expression 


t> G20? +I JS iJodi tJ J 7d2 +JPde? + Sid dide + Jide7Ji). 
1=2,Y,2 
Using the commutation relations among the J;, show that the 
complete operator equivalent for (18.8) is 


8[35J24 — 830 (J + We? + 25), — 6J(J +1) + 8% + 1)2]. 


18.2 Calculate 5 in problem 18.1 for a Cet++ ion, and hence 
derive the V,° contribution to the energy levels of Table 13. 

18.3 Discuss the splitting of the upper level in Fig. 11 under a 
tetragonal field, and then under an additional rhombic field and 
spin-orbit coupling. 

18.4 Show that Lz, Ly, L, have zero matrix elements between the 
levels marked Ay, and By,g in the tetragonal field approximation in 
Fig. 11. 

18.5 Prove the first step in equation (18.18) analytically. 

18.6 Show that the second order terms (18.28) which involve 
Vrn are all zero or a constant. 

18.7 Calculate the eigenvalues of the spin-Hamiltonian (18.32) 
when the magnetic field is zero. Associate each level with the 
correct irreducible representation (Table 14) of the group mmm. 

18.8 Calculate the selection rules for magnetic dipole transitions 
(problem 13.2) among states transforming under the various irreduc- 
ible representations of the group mmm. 

18.9 A sample of chromous sulphate is placed in a microwave 
cavity such that the oscillating magnetic field is in the z-direction, 
referred to the same axes as the spin-Hamiltonian (18.32). What 
magnetic dipole transitions are allowed among the levels (a) in zero 
magnetic field, (b) when the steady applied magnetic field is in the 
z-direction, and (c) when the applied field is in the y-direction? 

18.10* Outline the theory of the paramagnetic resonance spectra 
of hydrated cobalt salts (Abragam and Pryce 1951b). 

18.11 In a dilute alloy of cobalt in copper, three cobalt atoms 
are arranged as a cluster in a straight line at the nearest neighbour 
distance apart. Assume that they have a spin of one, and that their 
interaction energy is 


H# =—aS,-S, + aS,-S, + 68, - Ss, 
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where the S; are the spins of the three atoms with atom 2 being 
the central one, and a is positive (antiferromagnetic coupling). 
Show that the ground state of the cluster may be expected to have 
S =0 or 1 for physical reasons. Hence calculate the ground state 
energy by writing down the wave functions using Wigner coefficients 
(§ 9) and all the symmetries of #. Discuss how you would do the 
calculation if # contained higher order terms like (S, - S,)? + 
(S, - S,)? or anisotropic exchange (Siz + S3z)S2z. 


19. Time-Reversal and Kramers’ Theorem 


Introduction 


In addition to the types of symmetry transformation listed in 
§3 and studied so far, almost all} quantum mechanical systems 
have an additional symmetry of a rather different nature, called 
time-reversal. Consider the time-dependent Schrédinger equation 


, OV 
HOE = th Ot” 
If we apply the simple time-inversion substitution t - —t (denoted 
by 7), we obtain 
ov 
HE = —ih 2 
and we note that the Schrédinger equation is clearly not invariant 
under 7 because of the minus sign. However we can easily fix this 
up by taking the complex conjugate as well, and the combined 
operation of time-inversion and complex conjugation is called the 
time-reversal transformation T. We shall first exhibit some of its 
features in a simplified form by an example. 
An electron is moving, say in an atom, in a spherically symmetrical 
potential on which is superposed a uniform electric field @ in the 
z-direction. The time-independent Schrédinger equation is 


h? ; 
|- — V2 + V(r) + eb: bp = Exp. (19.1) 


The Hamiltonian is invariant under rotations about the z-axis, 
and we can sort out the eigenfunctions to transform according to 


t The only known exception is a system in an externally applied magnetic 
field. There is also reason to suspect that the nuclear interaction involved in 
beta decay (§ 33) may not be invariant under timo-reversal (Jackson et al., 
1957). 
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the representations exp(im¢) (7.3). Since we are considering only 
one electron, the wave function can in fact be written 


b =f (r, 8) exp(img). (19.2) 


These representations are all one-dimensional, and at first sight one 
might suppose that they all have different energies so that all 
levels are non-degenerate. However, this is not so, as can be seen 
as follows. We take the complex conjugate of (19.1), obtaining 


[- Bvt + Vin) + ede] p+ = By. (19.3) 


This shows that * is also an eigenfunction belonging to the same 
energy as %. Now (19.2) shows that * belongs to the representation 
with —m, so that if m + 0, 4 and 4* must be linearly independent. 

Thus all the levels with m ~ 0 are at least two-fold degenerate; 
- and when they are n-fold degenerate then » must be even. More 
generally, the wave function including spin for a many-electron 
atom satisfies a Schrédinger equation which is much more com- 
plicated than (19.1). However, the two-fold degeneracy between 
wavefunctions with quantum numbers M, and —M, remains. 
In particular for an odd number of electrons 


My, = half an odd integer + 0, 


so that all levels have even degeneracy. This is called the Kramers’ 
degeneracy. 

In the present example, the degeneracy between wave functions 
belonging to representations m and —m could have been proved 
using the reflection symmetry of (19.1) in the plane z = 0 (ef. 
for instance the character table for the group com, appendix L). 
The use of the time-reversal symmetry y — ¥* is not really necessary. 

Summarizing, we have illustrated in a vague kind of way the 
following essential aspects of time-reversal symmetry. 

(i) Time-reversal symmetry involves taking the complex con- 
jugate ¢ —> ~*. It is, therefore, not a simple transformation of co- 
ordinates as discussed in § 3. Consequently the representation 
theory of § 5 and appendix C do not apply, and the time-reversal 
symmetry has to be considered from first principles. 

(ii) To prove the existence of a degeneracy due to time-reversal, 
we have to prove firstly that % and 7’ belong to the same energy 
level, where 7 is the time-reversal operator (19.5) below, and 
secondly that ¢ and Ty are linearly independent. In this way it 
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can be shown that an atom, with an odd number of electrons placed 
in any type of electric potential but in the absence of a magnetic 
field, has all energy levels n-fold degenerate where n is even. This 
is Kramers’ theorem. 

(iii) It is usually only necessary to consider time-reversal sym- 
metry explicitly for systems having a low spatial symmetry and 
having spin dependent wave functions. In other cases the degen- 
eracy produced by time-reversal is usually produced also by some 
spatial symmetry. These points will become clearer as we proceed 
with the detailed development. 


The time-reversal operator 
The time-dependent Schrédinger equation is 


(x — ih 3) w(t) =0. (19.4) 


Recapitulating briefly the initial argument, the operator in this 
equation is clearly not invariant under the simple time-inversion 
substitution ¢ >» —t. Thus if there is to be a time-reversal trans- 
formation, it must have a more complicated form, and we define 
the time-reversal operator J’ by the relation 


TYP(ry, Cz; t) ao P(r, C2t, —t). (19.5a) 


Here we have abbreviated all the electron co-ordinates r,, oz, 
Ye, Oze,---In, Ozn tO 4, Oz. 

This operator 7’ has some important properties. Firstly suppose 
Y is an energy eigenfunction, i.e. it has the form 


Wri, o2, t) = W(t4, ox) exp(—iEt/h). (19.6) 


In this section we shall always use ¥ and ¢ to denote time-dependent 
and time-independent wave functions respectively. We have 


TW = p*(ry, ox) exp(—iBt/h), (19.7) 


so that if TY is also an eigenfunction of the Hamiltonian, then it 
belongs to the same energy level. 
From (19.5) we have immediately the second property 


Tia, + bY,) = a* TW, + b*TY,, (19.8) 


Thus 7 is not a linear operator, and much of the representation 
theory of § 5 and appendix C does not hold, for instance the equations 
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leading to the equivalence relation (5.15). We shall just note one 
consequence of this to avoid confusion later. In § 7 it was shown 
that ordinarily a cyclic group has only one-dimensional irreducible 
representations, but it is shown below in proving Kramers’ theorem 
that the cyclic group Z, T, T? = —E, —T (cf. equation (19.13b)) 
has two-dimensional irreducible representations. Thus time-reversal 
can lead to degeneracy. 

In § 11 it was shown that the spin functions u,, w_ transform 
under rotations according to the representation D“/®) with matrices 
(8.24). Since these matrices are complex, the functions u,, u_ 
have been regarded quite properly as complex quantities. However 
this notion requires some care because it is possible to interpret 
the complex conjugates u,*, w_* in two different ways.t To obtain 
the first interpretation, we recall the definition (11.12) of u,, uw, 
and taking the complex conjugate of (11.12) we obtain w,*(1) 
= 1, u,*(—1) = 0, u_*(1) = 0, u_*(—1) = 1. Since wu,” is still a 
function of the variable oz, it must be expressible as a linear combina- 
tion of wu, and w_. Comparing with (11.12), we have 


u,* =U,, u* =uU_. (19.9a) 
This is the interpretation one uses in calculating matrix elements: 


for instance we have wu,*(l)u_(1) =0 =wu,*(—l)u(—1) and 
> u,*u_ = 0. More generally 


> u,*ug = B29 (19.9b) 


as in the footnote to equation (13.1). 
The second interpretation of u,.*, u_* is obtained from the trans- 
formation properties. The complex conjugate of (11.13) is 

Ru,* = a*u,* + b¥u_*, 

Ru_* = c¥u,* + d*u_*, (19.10) 
where F# is a real transformation of the real co-ordinate axes Ox, 
Oy, Oz. From the form (8.24) of the matrices of D“/2), we have 

a* = d, b* = —e, c* = —5d, d* —a, 
and R(u_*) = a(u_*) + 0(—u,*), 
R(—u,*) = e(u_*) + d(—u,*). 
{ This confusion arises because our discussion of spinors is somewhat 
inadequate, in particular because we have not distinguished between row and 
column spinors. In fact (19.9b) should be written 2Li,*ug = 6.8 using the 


row spinors é,*, and (19.10) is correct as it stands with the u,* being column 
spinors like u,. 


12 
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Thus wu_*, —u,* transform like u,, u_, as follows alternatively 
from problem 8.16. As before, w,*, u_* must be expressible as 
linear combinations of u,, u_, so that by Schurs’ lemma (problem 
D.2) u_*, —u,* are proportional to u,, u_. We may choose the 
arbitrary phase factor as unity, and obtain 


“u*=—u, wt =4,. (19.11) 


It is this latter interpretation that is required in connection with 
(19.5a), so to be explicit we shall complete the definition of time- 
reversal (19.5a) by the relations 


(19.5b) 


For instance for one electron we have 


Tf (rus, + f(r)u_] = f_*(r)u, —f,*(r)u_, 


and for a many-electron function (11.11b) we obtain 


Thorp (r1, r2, 2.6 -)Ugitige eee 
= Worn (Pr, Va, . « -)(Tuay)(Tipe) ... (19.12) 
For an ordinary function f(r) we have (f*)* =f, but note that 


from (19.11) (w,*)* = —u,, (u_*)* = —u_. It follows then from 
(19.12) that 


T?Y —W ‘for aneven number of electrons, | (19.13a) 
= —W for an odd number of electrons. | (19.13b) 


Transformation of the Hamiltonian 
Operating on the Schrédinger equation (19.4) with 7’, we obtain 
from (19.5a), (19.8) 
0 = T(# — iho/a)Y 
= THTOTY — (—th)0/0(—-) T¥ 
= (THT — tho/a)TY?, (19.14) 
where T#’T-! is the transformed Hamiltonian (cf. footnote to 
equation (5.4)). Since (19.5) does not state how to calculate T#T-1, 
this product is defined in terms of its effect on an arbitrary function. 
Let ¢ be arbitrary; then 7'¢ is also arbitrary. Suppose 7 = #, 
+1#;,, where #, and #; are real operators. By a real operator 
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we mean an operator which operating on a real function gives a 
real function.| Then 


[Ty + iH))TIT$ = TH + Til 
= H,Th —i#,TS 
= (#7, —1#;)T4, 


as follows as an extension of (19.8) or alternatively by writing 
¢ = dr + i¢;. Thus we have 


THT =H#*, 


i.e. we obtain the time-reversed operator T#’T- by replacing every 
operator in # by its complex conjugate. For instance a potential 
V(r) is real and remains unchanged. The momentum pz = —thd/dx 
is pure imaginary and changes sign, p—> —p. Also the angular 
momentum operators change sign. For the orbital angular momen- 
tum this follows from the definition 1 =r (A p, but for the spin 
angular momentum it is necessary to go back to its definition 
(11.6) as follows. A rotation FR is a real operator,{ and it follows 
from (8.1) that iJ, is real, and that all angular momenta fl, are 
pure imaginary operators. Since # is a real function of r, p, 8, we 


have . 
THT =4* =H (vi, —Pi, —Si). (19.15) 


Kramers’ theorem 


This theorem states: in the presence of any electric potential 
but in the absence of an external magnetic field, every energy level 
of a system with an odd number of electrons is n-fold degenerate, 
where » is an even number (not necessarily the same for each level). 

We first note that in the absence of an external magnetic field, 
the Hamiltonian contains only even powers of the momenta. This is 


} This notation differs from the one used for instance by Dirac (1958) who 
uses “‘real’” to describe an operator with all real eigenvalues. 

{ This is obvious as regards its action on an ordinary function f (x, y, z) 
because it is a real transformation of the real co-ordinates x, y, z. It is also 
true with respect to its effect on spin functions, for consider it operating on the 
real function (uw, + u,*): 

Rus + u*) = (au, + atu,*) + (bu_ + b¥u_*) 
= real. 
Incidentally, note that in deriving this result we have used (19.10), which 
explains why we use the definition (19.11) of complex conjugation derived 
from (19.10), rather than using (19.9). 
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so for the kinetic energy p?/2m and for all the spin-orbit and spin- 
spin coupling terms of (11.8), (11.9) like l;-s;, rz; A py- Si, S¢° Sy, 
etc. Thus 

HA (ti, Ds, 81) = H(t, —Pi, — 8), (19.16) 


or from (19.15) 
THT! =H#. (19.17) 


In the presence of an external magnetic field H, the Hamiltonian 
contains the term (e/2mc)H-(L + 2S) (Schiff 1955, p. 292) which is 
linear in the angular momenta, so that (19.16) does not hold. This 
shows why the theorem must be restricted to systems not in an 
external magnetic field. Note, however, that interactions such as 
the spin-orbit coupling which depend on the internal magnetic 
fields of the system generated by the moving electrons are invariant 
under time-reversal, since these internal fields change sign if the 
momenta of all the particles are reversed. This is all completely 
analogous to the situation in classical mechanics (problem 19.4). 

It follows from (19.17) that if #7 = Hy, then Ty = ETY, 
so that 4 and 7 are eigenfunctions belonging to the same energy 
level. For this to give a degeneracy we have to show that they are 
linearly independent. Suppose 


Tis = op (19.18) 
where « is some constant. Then 
Tb = Tors = a* Ty = a* op. 


For a system with an odd number of electrons, this gives a contra- 
diction to 177s = —y (19.13b) since «*a is positive and cannot 
possibly equal -1. Thus (19.18) is false, and % and 7’ are linearly 
independent. Since 7% = —7, the degeneracy of every energy 
level is even, which proves the theorem. 


Energy level degeneracies 


Let us consider an atom in a J = 3/2 state placed in an electric 
field with point-group symmetry 32 due to its neighbours in a crystal 
or molecule, and let us determine the possible splitting of the level 
in the manner of §§6 and 14. The J = 3/2 states transform according 
to a double-valued representation I under rotations, and their 
characters for the double-group of 32 are (§ 16 and equation (14.4)) 


x(E) _x(#) xB) x(A)—xX(K)— xR) 
4 -4 -1 1 0 0 
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From Table 12 we have 


P=C,+T,+T,, (19.19) 


so that we might expect the level to split into two singlets (I",, I’,) 
and one doublet (I). However we have not considered the full 
symmetry group of the Hamiltonian because this also includes time 
reversal, and Kramers’ theorem shows that the greatest splitting 
possible is into doublets. Hence in this example time-reversal 
leads to a degeneracy, at least between I’, and I’,, in addition to the 
degeneracies (I°,) due to rotational and other symmetries. 

We shall now investigate the question of degeneracy more 
generally, and consider a degenerate set of states ¥; transforming 
according to an irreducible representation D under some group 
© of rotational or other symmetry transformations R. 


Ryy = Dy Ry (19.20) 


There are three cases to consider. 

(a) The representation Dj(R) is real, or can be made completely 
real by an equivalence transformation (5.15) of base vectors. 

(b) The representations Dy{R) and Dy*(R) are not equivalent. 

(c) The representations Dj(R) and Dy*(R) are equivalent, but 
they cannot be made completely real by an equivalence transforma- 
tion (5.15) of base vectors. 


We have to subdivide these further and discuss separately systems 
with even and odd numbers of electrons. We also assume that the 
Hamiltonian is invariant under time-reversal, and shall determine 
in which cases this leads to some degeneracy in addition to that 
expected from the symmetry group & alone. 

Consider first a system with an even number of electrons. Case 
(a). Suppose the yy have already been so chosen that Dy(R) is 
real. Then 


RT Wy = Dy*(R)T hy = Dy(R)T Hr, (19.21) 


and the functions ¢; = ¥, + Ty also transform under & according 
to Dy(R). Further from (19.13a), 7¢; = ¢; so that the set ¢; 
transforms into itself under all symmetry transformations, both 
those of © and time-reversal. Thus there are no symmetry pro- 
perties connecting the set ¢; with any other wave functions, and 
hence there is no additional degeneracy (§ 6, Theorem 2). An example 
of this case is the representation D of the rotation group when Jis an 
integer. The functions Yim + (—1)™Yi,_mandi[Yim — (—1)™ Yi _m] 
are real base vectors transforming according to D‘ so that the 
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matrices of the representation are all real. Here the Yim are 
spherical harmonics transforming in the standard way (8.18). 
Other examples are the single-valued representations of the group 
32. Case (b). The 7; transform according to Diy*({R) which is not 
equivalent to Djj(R), so that the 4; and 7; are linearly independent 
(lemma 5, appendix C). The representations Dy(R) and Dy*(R) 
therefore always occur together as a pair. Since # is invariant 
under 7’, they belong to the same energy level and we have an 
additional degeneracy. There are several examples of this among 
the representations of the crystal point-groups (appendix K), for 
instance the two complex representations of the group 4 which are 
bracketed together and labelled E. Since the two representations 
are always degenerate when there is time-reversal symmetry, 
they behave for elementary purposes as a single doubly-degenerate 
irreducible representation. This is the reason for bracketing them 
together and labelling the pair with a single letter. Case (c). In 
this case time-reversal leads to additional degeneracy, as the reader 
should have no difficulty in proving by following the hints in 
problem 19.11. The only example of this case which the author 
has come across is given in problem 26.10.—Note that if we are 
neglecting the electron spin completely, an orbital wave function 
always satisfies (19.13a) and the effect of time-reversal symmetry 
is exactly the same as in the even-number-of-electrons case. 

Consider now a system with an odd number of electrons. The 
difference is that we have to use 7% = —y now instead of 7% 
=y (19.13). Case (a). The functions ¥; and Ty; again transform 
in the same way (19.20), (19.21) under G. As in the proof of Kramers’ 
theorem (below 19.18)), we cannot have Ty; = ay;, which is the only 
possibility if they belong to the same irreducible vector space. 
The representation D therefore occurs twice and we have an extra 
doubling of the degeneracy because of time-reversal. Case (6). 
Since in the discussion of case (b) for an even number of electrons 
we did not use (19.13), the situation is exactly the same for even 
and odd numbers of electrons. Case (c). In case (c) for an odd 
number of electrons, it can be shown that we may always choose 
base vectors ¥,; such that 7; is a linear combination of the yy 
(Wigner 1932, p. 557; Johnston 1958). There is therefore no extra 
degeneracy. An example of this case is the representation D(/® 
of the rotation group (problem 19.8). In fact all the representations 
D of the rotation group come under this case when j is half an 
odd integer (problem 19.9). Also it can easily be shown (cf. 19.19)) 
that u,, w_ transform according to I, under the point-group 32 
(Table 14), which forms another example. 
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We summarize these results as follows. The three cases are: 


(a) the representation D can be transformed to real form; 
(b) D and D* are inequivalent; 
(c) D and D* are equivalent but cannot be made real. 
For an even number of electrons or neglecting spin, this implies 
respectively the following consequences: 
(a) there is no additional degeneracy; 
(b) D and D* occur together and there is an additional 
degeneracy between them; 
(c) there is an additional degeneracy. 
For an odd number of electrons the consequences of (a) and 
(c) above are reversed. 


There now remains the question of deciding which category a 
given irreducible representation comes into, particularly deciding 
between (a) and (c) since case (b) can be picked out by inspection 
of the character table. We state without proof the following test 
which depends only on the group characters: 


> x(R?) =h, case (a) 
m = 0, case (b) 
case (c) 


(19.22) 


Here the summation is over all the h group elements F of the 
group ©. Parts (a) and (b) of this relation can be proved easily 
by elementary methods using the hints in problem 14.16; case (c) 
is more difficult (Wigner 1932; Frobenius and Schur 1906; Johnston 
1958). For example, from Table 12 it follows that for the group 32, 
I, I’, I’; belong to case (a), I", and I’; to case (b), and I, to case (c). 

Returning to the example of a J = 3/2 level being split by a 
crystal field of symmetry 32, we have that the states transform 
under the group 32 according to I” (19.19). Since J is half an odd 
integer, we are dealing with an odd number of electrons. The 
representations I", and I’, come under case (b) and give a doubly 
degenerate level. I’, is two dimensional and comes under case (c); 
this gives no additional degeneracy due to time-reversal and we 
have another doublet. Similarly the states of a J = 5/2 level trans- 
form according to 


Mth t+h+T, 
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under the group 32. J, and I, together give a doublet. The two 
representations I, each give a doublet, and there is no reason to 
expect a degeneracy between them. 
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Summary 

We have discussed the transformation known as time-reversal. 
It is a symmetry transformation of the Hamiltonian for any electronic 
system in any electrostatic field provided there is no externally 
applied magnetic field. It leads to various additional degeneracies 
beyond those expected from rotational and other symmetries, 
particularly in the case of systems with a relatively low symmetry. 
In particular Kramers’ theorem states that for an odd number of 
electrons all states have an even degeneracy. 


PROBLEMS 


19.1 The Hamiltonian for a system of electrons is known to be 
invariant under space inversion JJ and under time reversal T. 
Consider the following interactions between two particles 1 and j 
where p = pi — py and r = rj — 1: (8¢°8))(P'r), T° PA (Si + 84), 
r-p A (8; — 8), r-D A (SA 8) F-(8¢ — 8), (F-8:)(F +8). Show 
that only the second and the last can occur in the Hamiltonian. 

19.2 Consider a single-electron time-independent wave function 
y. Show that the operator T can be written ioyC as regards its 
effect on ys, where (' takes the complex conjugate of the orbital 
part of y, and cy is the Pauli spin operator of problems 11.1, 11.2. 
Show that for ” electrons 7’ becomes (1)"ay,ay9 . . . GyaC. Use this 
form to establish (19.13). For one electron write the Hamiltonian 
in the form #) + #02 + #e0y + Hc, and hence prove (19.15) 
(Klein 1952). 

19.3 The function ¥,, belongs to the representation exp(iM4) 
of the axial rotation group. Show that 7'y,, belongs to exp(—tM¢). 

19.4 A classical system of particles is moving under velocity 
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independent forces. Show that the Hamiltonian #°(q, 9) = HN; 
—p), where the q;, »; are the generalized co-ordinates and momenta. 
If gi(t), pi(t) gives a particular motion of the system, show that 
gi(—t), —p(—t) gives another possible motion, which is simply 
the original motion reversed. Discuss physically and mathematically 
how this situation breaks down in the presence of an external 
magnetic field. 

19.5 Discuss the splitting of a free-atom level with quantum 
number J under the action of an electric field with point-group 
symmetry 32, when J = 0, 3, 1, 3,... 6. 

19.6 Discuss the splitting of the J = 5/2 ground state of a cerium 
ion in a field of symmetry 62m (§18). The double-valued irreducible 
representations of this group can be calculated in the manner of 
§ 16 or found in Bethe (1929) or Koster (1957). 

19.7 Show that {(7)*(74) dr = [¢*dr, where ¢* is used 
in the sense of (13.1) and (19.9). Z is any angular momentum 
operator. Prove its Hermitian property {y*Iédr = = tls dz, 
and that it is a pure imaginary operator TL) = —LTy. Hence 
show fy* Ly dr = —f{(T'p)*L(T'p) dr. Also prove that the expecta- 
tion value for the magnetic moment A(L + 2S) is zero for any 
non-degenerate level. 

19.8 Show from first principles that there are no linear combina- 
tions w,, w, of the vectors u,, u_ which transform with real coefii- 
cients under all rotations. Hint: suppose that w,, w, exist. Using 
problem D.2 show that Tw; = aw;, which, however, leads to a 
contradiction as shown in connection with (19.18). 

19.9 The functions 4m transform according to D under rota- 
tions, where j is half an odd integer. Using problem 8.16, show that 
the functions dm = %m + 1i-°™Tyb_m transform into one another 
under rotations and time-reversal. Show the same for the functions 
Xm = Uhm — 1-?=™T_m). Prove that this representation always 
comes under case (c) of the text. Hint: show that cases (a) and (b) 
do not apply by using the method of problem 19.8. 

19.10* Ifthe ground state of a molecule or molecular complex is 
degenerate, then the molecule spontaneously distorts itself so as to 
split the ground state and lower the degeneracy. This is the 
Jahn-Teller effect. Exceptions occur for linear molecules, or when the 
degeneracy is the two-fold Kramers’ degeneracy which cannot be split. 
Illustrate the origin of this effect, and discuss the importance in it 
of time-reversal symmetry, particularly when taking spin into 
account (Jahn and Teller 1937, Jahn 1938). 

19.11 Prove that in case (c) for an even number of electrons, 
time-reversal symmetry leads to a doubling of the degeneracy. 


176 GROUP THEORY IN QUANTUM MECHANICS 


Hints: show that 7'y; cannot be linearly dependent on the yy for 
all j, because if it were we would have functions ¢; = yy, + Tyy 
in the vector space (,, ... ¢n) satisfying T¢; = ¢; giving case 
(a). Thus at least one Ty; is not a linear combination of the yy, 
and by orthogonalizing and then using problem 14.15 we can 
construct from it a whole set of functions orthogonal to the 4 
but degenerate with them. 


20. Wigner and Racah Coefficients 


Coupling of angular momenta 

Suppose that Uy) and Vy’ are two sets of normalized 
functions transforming respectively according to D‘) and Dts”) 
under rotations. For the present we shall also assume that the two 
sets are functions of two quite different lots of variables, such as 
orbital variables and spin variables or the co-ordinates of two 
different particles. Then the products Un')Vi_'5" are also normal- 
ized. Using the method of § 9, we can in principle pick out normalized 
linear combinations 


Wag = > (ij’mm! TMU PV ''9” (20.1) 


mm 


which transform according to D“), where 


j-7] <I <jti. (20.2) 
The phase of the coefficients (jj’mm’'|JM) is fixed by making 
(37'7, J — j|JJ) = real and positive, (20.3) 


for each value of J. This defines the vectors (20.1) completely and 
hence also the coefficients (7j’mm’|JM), which are known as Wigner 
coefficients or alternatively as vector-coupling or Clebsch—Gordon 
coefficients. Now since in (20.1) the vectors Wu and Um) Viq?'S? 
are all normalized, (20.1) can be regarded as a unitary transforma- 
tion in the vector space {..., Um' Vm", . . .} so that the matrix 
A of Wigner coefficientst is unitary A-! = A* (problem A.10). 
Furthermore since (8.18) contains only real coefficients, the process 
described in § 9 for picking out the vectors Wy) (20.1) leads only 
to real numbers for the Wigner coefficients, so that 


A-1 = 4, (20.4) 
} In this matrix each row is labelled by @ pair of numbers m, m’, and each 


column by J, M. The matrix is clearly square of order equal to the dimension 
of the vector space (cf. problem 9.7). 
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Hence we can invert the equations (20.1) and write 


Um? Vm) = > (Gj'mm' |JM)Wy”. (20.5) 
J,M 


The situation becomes slightly more complicated when the 
U’s and V’s are not functions of two completely different sets of 
variables, for now the products Uim'))Vm’9" are in general not 
normalized. However the vectors 


> (jj'mm' IM) PV m5? (20.6) 
mm 


still have the same transformation properties (8.18) as in the previous 
case of (20.1). Hence (§ 8) all the vectors with the same J have the 
same amplitude, and we only need to include a factor N, in (20.6) 
to obtain normalized vectors 


Wy =Ny > (jj'mm' |JM)0 Vn’, (20.7) 
mm 


where N,, is independent of M. Equation (20.4) can again be used to 
invert (20.7). 


Um DV mI = > (ij’mm' |\JM)W y/N 5. (20.8) 
MJ 


This justifies the equations (9.7), (9.8) given in § 9. As already 
pointed out in § 9, we have 


(jj’mm'|JM) =0 unless M=m-+m', . (20.9) 
so that the summation in (20.8) reduces to a summation over J only. 


Spinor invariants 

We shall now carry out the first partt of the calculation of a general 
formula for the Wigner coefficients. This will also serve to introduce 
and illustrate the technique known as the method of spinor invariants 
(Brinkman 1956). Consider a sum 


® = cp; — (j summed), (20.10) 


to which we apply a unitary transformation T. With the nomencla- 
ture explained below equation (5.14), we suppose that the ¢, 
transform as base vectors — 


Ts = Tydi (20.1 1a) 
{ This essentially follows the discussion of Van der Waerden (1933). 
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and the c; as coefficients 
Te; = of T In = Ty*er, (20.11) 


where the last equality follows from the unitary property (problem 
C.2). Then the sum (20.10) remains invariant under 7’, and is called 
an invariant. In order to compound angular momenta in the manner 
of (20.7), it is convenient not to consider arbitrary sets of U’s and 
V’s, but to use the specific sets of vectors 


j+m,,j—-m 
Wy be 


oft yf 
= (Gj + mg — + mg a m)t 2 jG) + 


(GF +m i — mye 
(20.12) 


Here u,, u_ and v,, v_ are spinors transforming according to D@/® 
under rotations, and we recall from problem 8.11 that the vectors 
(20.12) do transform in the standard way (8.18) according to the 
representations D') and DG"), The method of spinor invariants 
derives its name from the use of invariants (20.10) in which the 
functions are of the form (20.12). 

Step 1. We first suppose that x,, x_ are two quantities that trans- 
form as coefficients according to D\/), Thus 


Um) 


Vin 


LU + BU (20.13) 
is an invariant. Step 2. Since (20.13) is invariant, so is the quantity 
(wu, + 2u_) = (QI) > XyUy™, (20.14) 

M 


where the Uy,‘Y) are defined in the manner of (20.12). The coeffi- 
cients Xy) therefore transform according to DY) as coefficients. 
By expanding the binomial in (20.14) we obtain explicitly 


a tM yy 


<u =F Ms — 


(20.15) 


Step 3. Under a rotation R the vectors v,, v_ transform according 
to (8.24), which may be written 
Rv, = av, + be_, 


Ro_ = —b*v, + a*v_, (20.16a) 


or R(v_) = a*(v_) + b*(—v,), 


R(—v,) = —b(v_) ++ a(—v,). (20.16b) 
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Comparison of (20.16) with (20.11) shows that v_, —v, transform 
as coefficients, and hence 


V_U, — V,U_ (20.17) 
is an invariant. 
Step 4. We now consider the expression 


= (vu, — v,u_)(e,u, + xu_)jo-"(x,0, + x_v_)ji> 
(20.18) 
where A=jt+j' —Jd. 


From (20.13), (20.17), Z is an invariant. Each factor can be expanded 
by the binomial theorem 


A 
(vu, —v,u_y = > (—1)r (") (v_u,)T(v,u_)’, 


r 
where the binomial coefficient (") is defined by (20.25) below. 


Collecting the powers of u,, v,, x4, together, we obtain 


i= 2, 2 [(9 + m) Gj — m)(9" + m')\(9’ — m' (J + MJ — M)iy? 


23 —A 25’ —) y 
>, (— ir(? ee a ae Mi te m' — ; UV a Ke is 
= AQF — AM 2’ — AEDS S aay Vin? Vie XD ants (20.19) 
m m’ 


wheret 


Om = > (Hf x 
"LG + mj — myiig’ +m’)! x 
x (Fj — mJ + MJ — My? 


* Fm — Ng +m —A+nIG" +m! — rx 
x (j — mm’ —A+r)l(r)(A — 7)! 
(20.20) 


Since the Xy,“) transform as coefficients according to DY), the 
quantities 


DS a Um? Vey? (20.21) 
mm 
with UM =m-+ Mm’ 


{ The summation is over all terms such that none of the brackets becomes 
negative. Also 0! = 1. 
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which multiply the X,, in (20.19) must transform as base vectors 
according to DW), This is the coupling of angular momenta which 
we require, and comparison with (20.7) gives the proportionality 


(jj’'mm’ |IM) = byyyOm (20.22) 


where bj;;’ is a constant. 


A general formula for Wigner coefficients 


A consideration of the result (20.22) illustrates both the power 
and the limitation of group theory. We have used transformational 
properties to relate the vectors w,,) with the same value of J 
to one another, and this gives the relationship (20.20), (20.22) 
between the appropriate Wigner coefficients. However we have not 
obtained the actual values of the Wigner coefficients, or what is the 
same thing, we cannot so far compare coefficients with different 
J’s in any systematic way. To do this we have to take account of 
some algebraic considerations (as distinct from the effects of 
rotational transformations above), namely (20.3) and the fact 
that the definition (20.1) of the Wigner coefficients requires vectors 
normalized to unity. Now there is no direct way of normalizing the 
Um™ and Vin'4” defined by (20.12), so that we have to use instead 
the algebraic relation (20.4). This gives 


AA =Eor 
> (ji’mm'|JM)? = 1, (20.23) 


where the summation over m, m’ is subject to m + m’ = M. 

Following E. R. Cohen (1949), we now calculate 6,74’ by sub- 
stituting (20.22) in (20.23) for one convenient value of M, namely 
M =J. Equation (20.22) then gives all the Wigner coefficients. 
For M=—J we have m’ —J—m, and the summation (20.20) 
reduces to the single term r = j — m. Hence from (20.22) 


(jj'm, J — m|JJ) 
__ bay 21) E +I = mj + = ua 
GETING FID LG TF yg ml] 
+ By interpreting the u,, vw as creation and annihilation operators for 
spin 4 particles, Schwinger has been able to utilize the symmetry properties 


of expressions like (20.12) and at the same time to work with normalized 
wave functions. This then gives the Wigner coefficients directly. 
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Substituting this into (20.23), we obtain 


b%r4y (2d) eed are) 
GFI-GNGFI-MEN jom Nj —T4+m) =" 


(20.24) 
We now use the formula 
(5) __ %(% — 1)(% — 2) -.. (2~B+1) 
B p! 
_ (ap Cae = DE 8 Gi A= 9) Ganon 


“#5 


The sum in (20.24) becomes 


> jJ+I—m\( j+m ) 
j—m j—-TJ+m 


™ 
: _y (iF -I-W 7-4-9 1 
SG, Nee) 
eee Aaa = (547 4443) 
jes“ G4a—a J 


Here the summation of the binomial coefficients is obtained by 
picking out the coefficient of 2/+5’—/ on each side of the identity 


(1 + 2)§-F-J-A] 4 a'-9-J-1 = (1 4 x) V2, 
Thus (20.24) together with (20.20) and (20.22) gives 


M 
m+m 


G+7 —J)\(j+7I—j)! x 1/2 
(jj'mm'\JM)=} XG AT —HIQT+D] 8 


G+7 ++i) 


(9 + m)'(j — mj" + mn’)! x 
ey ea 
| GH m= MIF + mG Fm’ — al x 

x (J —j-m 4+r)irlg +7’ —J—r)! 


(20.26) 
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Numerical values of some Wigner coefficients are given in appendix 
I, and algebraic tables for the cases 7’ = 3, 1, $ are given by Condon 
and Shortley (1951, p. 76). For references to further tables, see 
appendix I. 


Symmetry properties 

The Wigner coefficients have several symmetry properties obtained 
by permuting the quantum numbers. These symmetry properties 
could be obtained directly from (20.26) with some manipulation. 
However, it is more instructive to proceed as in the derivation of 
the general formula (20.26), i.e. to use first the transformational 
properties to establish a proportionality, and then to calculate the 
proportionality constant algebraically by considering a special 
case. We shall from now on use @, b, c instead of j, 7’, J, and denote 
the corresponding magnetic quantum numbers by «, , y. 

Let U,*, V,°, W,° be functions transforming according to D‘, 
D®), D), From (20.8) we obtain the matrix element 


(W,*)*U,2V 2 dr = (const)(abaB|cy). (20.27) 


This matrix element can also be calculated in another way by 
recalling (problem 8.16) that 


(—1)+¢-”(W,°)* (20.28) 


transforms under rotations in exactly the same way as W_,’. 
Hence analogously to (20.27) we have 


fU—y- 2900" [(—1) 7,1 9 dr 

= (another constant)(c, 6, —y, Bla, —«). 
Comparing this with (20.27) and putting y = « + 8, we obtain 

(abeeB|cy) = (—1)*+*Aave(cb —yB|a —a). (20.29) 
The two Wigner coefficients in (20.29) can easily be calculated from 
(20.26) for the special case « =a, B =c — a, y =c, and hence 


the constant Agy- evaluated. We obtain 


Qc + 1\¥2 
(aba ley) = (—1)>+ (4) (cb —yBla —a). (20.30) 
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All of the following basic symmetry properties can be proved in a 
similar way, and further symmetries derived by combining them. 


(aba |cy) = (ba —8 —a|c —y), 
= (—1)2+>-¢(baBa|cy), 
= (—1)+>-¢(ab —a —Ble —y), 


(20.31) 


Racah coefficients 

It happens in many calculations that sums of products of Wigner 
coefficients occur which are very difficult and tedious to sum by 
using the general formula (20.26) or a table of numerical values. 
By the use of the symmetry properties (20.31) and the unitary 
property (20.4), (20.23), it is usually possible to relate these sums to 
the Racah coefficients W(abcdef) which are defined by the relation 


(—1yer-t-« 
Wabedef)= > ET TDFA (20.32) 


a, B,y,5,¢ 


X (afad|cy)(cdy8 |ee)( fd |bB)(abzxB |ec). 


We shall show below that this expression (20.32) is independent of 
the quantum number « which is not summed over, and make one 
use of it. However, we shall not discuss the properties of the Racah 
coefficients in any detail. Since the coefficients relate different j 
quantum numbers, most of their properties cannot be derived purely 
from the effects of rotational transformations for much the same 
reasons as were discussed in connection with the Wigner coefficients. 
Suffice it to mention that the Racah coefficients have been used to 
simplify formulae in many branches of physics including the follow- 
ing: the angular dependence of scattering and reaction cross sections, 
the angular correlation of successive decays, nuclear structure, the 
theory of hyperfine structure, atomic and molecular spectra of 
complicated configurations. 


The calculation of matrix elements 
We shall now use Wigner and Racah coefficients to express the 
values of some general types of matrix element. Let T'm* be a set 


13 
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of operators transforming into one another in the standard way 
(8.18) under rotations according to the representation D‘), These 
are called irreducible tensor operators. For example the operators 


Ppail,, Sie = a, 


transform according to D“) in the standard way. The components of 
a second rank symmetric tensor with zero sum of diagonal elements 
transform according to D'” (see text of § 9 and problem 9.8). 

We shall first consider the matrix element 


jim | Tm |jgtma> = [YC mr) Tm YJ mq) Ar. (20.38) 
From (20.27) we have 
<fum | Pm |jame> = By,g,n(kjgmme|jym), 


where from the fundamental theorem of §13 the constant By,;,x 
does not depend on the m’s. This result can be put in a more sym- 
metrical form by using the symmetry property (20.30): 


<j, |T'm |jgme> = of 4.1\ua 
Bygyel—1yma( 3 + 7) (Jade > mym,|k pS m). 


It has become conventional to write the constant By,j,k 28 


wp, Sill Mia 
(2, FI) 


Here <j,||7"|j.> is another constant called the reduced matrix 
element which also is independent of m,, m., m. The double bars 
denote that it is not a proper matrix element but just a quantity 
associated with the matrix elements (20.33). Thus, we finally 
obtain 


Byjgk = (—1)jt 


<jim | Pm |jaM2> 


<All lljn> 


= (—])iatktma Ok Fi (j1jg — mym,|k — m), 


(20.34) 


where the Wigner coefficient gives the dependence of the matrix 
element on m,, ™., ™. 
We consider next a matrix element of the type 


Z = (L,8,5,M,| > (—1)"Xm Y_p | LSJ 2M). (20.35) 
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Here the wave functions 


o( LgSyJ Mo) ee 2, (L_8_M 72M ge|JyM »)}{ LeS2M 12M se) 
eas (20.36) 
‘transform according to DV») under rotations, and are linear com- 
binations of the §(L,S,M,.M 5.) corresponding to energy level 
eigenfunctions in the limit of Russell-Saunders coupling (§§ 11, 12). 
Similarly 
p*(L)S,7,M,) ae 2 (LyS, MM g1 (Fi )y*(L)S,M aM 5). 
mMs 
sii (20.37) 
In (20.35) Xm‘* is an irreducible tensor operator which operates 
on the orbital variables only, and Y_m‘* on the spin variables 
only. Thus we have 
(Ly 81M 11M g1|Xm™ Y_m™ | LyS2M 12M s2> 
= (L,MypSM5|Xm™|L2M128Ms5> x 
x (SM 1 LM ,| Y_m™ [SoM saLDM 1,» (20.38) 


where the matrix elements are independent of S, Ms and of L, Mrz 
respectively. The value of each matrix element is given by (20.34): 


(L,M SM 5 |X!" | LM 125M s> 


- XD 
(20.39) 


and a similar formula holds for the second element. Substituting 
(20.36) to (20.39) into equation (20.35), we obtain for the matrix 
element 


Z = CLA XM||L,><S PM ||S_> 


( —] ymtek+In+Sy +Mr2t+ Ms 
x 


m, Mr, Mrs, (2k aa 1) 
Ms, Mso 
X (LS, 5; |F,M)( £,8.M 2M g0\JoM 2) 
xX (LL, —M,,M,,|k —m) (S,8, —M 5M so|km). (20.40) 


To simplify this, we first note that as in (20.28), (—1)™Y_m‘* 
transforms in the same way as (—1)*Y‘"*, where we assume 
k to be an integer. Hence 


> (—1)®Xm™ Y_»'™ transforms like (—1)* > Ym')*Xm™, 
m m 
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and is therefore invariant under rotations (problems 13.12). Hence 
from the fundamental theorem of § 13 it follows that the matrix 
element (20.38) is zero unless J; = J, (=Jsay) and M,=UH, 
(= M say), and that it is independent of M. We now use the 
symmetry properties (20.31) to write 
(L,L, — M,,M,.|k — m) 

2k+1 


1/2 
= (—I)-h Marttatt (> T) (L,kM,, — m|L,M 7.) 


(8, Ss —M 1 Ms2|km) 


2k+1 
—: {—1)~Se—-Msa 
Carsemn ey 


1/2 
) "6S, — mit se|S If). 


We also use M,,. — M,, -++ m = 0 in the exponent of (—1). The 
summation in (20.40) then takes on exactly the form (20.32) required 
for the Racah coefficients and we have 


Z = CL,|XM|L,><8,|| P'S, >(—1) 4482-7 W (LS, LSaJ hk). 


The fact that the matrix element is independent of M shows that 
the summation in (20.32) is independent of ¢. In the above discussion 
the use of “orbital variables” and “‘spin variables” is, of course, 
unnecessary and any two independent sets of variables will do. 
We can therefore generalize our result and write 


Chijogm| > (—1)™Xm\™ Y_m™ |9'15'25’m'> 
m 


= 844 Sanme(— Lyf 452-F Cj XO Gs> x (20.41) 
x <fall YP ll7’o> W(rjeds Jo I*), 


where X‘*") operates on one set of variables corresponding to the 
quantum numbers j,, j’;, and Y_m‘” on a different set corresponding 


to Je» J's 
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radiation; Schwartz (1955) on the application to the theory of 
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Summary 


We have first discussed the definition of the Wigner coefficients, 
particularly as regards their phase, and the unitary property. 
Using only transformational properties under rotations, we derived 
the formula (20.20) which relates all Wigner coefficients with 
different m’s but the same j. This calculation illustrated the method 
of spinor invariants, and led the way to a complete formula for the 
Wigner coefficients, as well as to a study of their symmetry pro- 
perties. The Racah coefficients were then defined, and two common 


types of matrix element were expressed in terms of Wigner and 
Racah coefficients. 


PROBLEMS 


20.1 Two normalized sets of functions uw, = Slr) Yim(8,¢) and 
v, =f2(7r) Y,,(8,¢) both transform according to D“) under rotations, 
where Yj are the spherical harmonics (8.20). Show that the products 
Umv, are neither normalized nor orthogonal, and that the vectors 


> (1, 1, m, BJM )umy,, 
m, 


are orthogonal but not normalized. Calculate their normalization 
constants. i 


20.2 By operating on both sides of (20.1) with Z_ (§ 8), prove the 
recurrence relation 


A,(abeB|cy — 1) = A,,,(aba + 1B ley) + Ag,(abaB + 1 Icy) 


where Am? = j(j + 1) — m(m — 1). 

20.3 Calculate the Wigner coefficients (j, 4, m, +47 + 4, 
m + $) in each of the following three ways: (i) from the general 
formula (20.26); (ii) by carrying out the procedure of § 9 in detail as 
in problem 9.3 starting with the vector with J =j +13, M = j++; 
and (iii) by using the recurrence relation of problem 20.2 and the 
other recurrence relation given by Condon and Shortley (1951, 


p. 74) starting with (j4j4|j + 4,7 + 4) =1. Which method is the 
quickest? 
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20.4 By squaring (20.29), summing both sides of «, 8 and y, 
and using the unitary property (20.23), show that one obtains 
directly 

2c + 1\¥? 
Aave = +44) 


Show how the correct sign can be determined from (20.3) and 
(20.20) without using the general formula (20.26) for the Wigner 
coefficients. Note that this would be a more elegant way than the 
method used in the text for establishing the symmetry property 
(20.30), if it were not for the difficulty of establishing the correct 
sign. 

20.5 Prove all the symmetry properties (20.31) of the Wigner 
coefficients. Show that (ab00|c0)=0 if a+b+c—=an odd 
integer. 

20.6 Using the symmetry properties (20.31) and the fact that 
the sum (20.32) is independent of «, show that 


Wiabedep es 2 We MGs ae: 
X (aba |ec)(bdB5 | f4)(afad |ey)(eded|cy). 


This is the more usual definition of the Racah coefficients. 
20.7 By comparing problem 20.6 with equation (20.32), show 
that 
W (abcdef) = (—1)¢+f--¢ W (afedcb). 


Similarly prove that 
W (abcdef) = W(badcef) = W(cdabef) = W(acbdfe) 
— (—l)etf-a-4d W (ebcfad) = (—1)e+f->-¢W (aefdbe). 
20.8 Write down the value of the matrix element 
«LSI M|L- S| LSJM)> 


in terms of a Racah coefficient, and by comparison with (13.20), 
(13.22) calculate a formula for W(LSLSJ/1). 
20.9 Express the spin-spin coupling 


$4 °Sy/rej? — 3(Si° Piez)(Sj + Tey) /Tey® 


in the form 2(—1)"Xm\"Y_»” as in (20.41), where the X’s 
operate on the orbital variables only and the Y’s on the spin 
variables only. Also express 1/|r; — rj| in a series of terms of this 
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form, where in each term the X’s operate on r; only and the Y’s 
on r; only. 

20.10* Discuss the use of Racah coefficients in one of the follow- 
ing fields (see references in the text): (i) the angular dependence of 
scattering and reaction cross sections, (ii) the angular correlation of 
successive decays, (iii) nuclear structure, (iv) the theory of hyper- 
fine structure, (v) atomic spectra of complicated configurations. 


21. Hyperfine Structure 
Introduction 


In Chapter II, we found that the energy levels of an atom can be 
described in terms of configurations, which are split into terms by 
the electrostatic repulsion between the electrons. These terms are 
split again into levels by the spin-orbit coupling, where the levels 
are designated by a J quantum number. We now consider the 
previously omitted term #nuci of (10.1), which takes into account 

(i) the motion of the nucleus, its finite size, and the deviation 

of the potential from a pure Coulomb field near and inside the 
nucleus, and 

(ii) the spin, magnetic moment, quadrupole moment, etc. of 

the nucleus. 

The items in category (i) only introduce small quantitative shifts 
of the energy levels, known as isotope shifts because they are 
different for different isotopes (Ramsey 1953). We shall not discuss 
them further. On the other hand the items in category (ii) above 
lead to interesting qualitative splittings of the atomic energy levels. 
These splittings due to the interactions between the electrons and 
the nucleus are called the hyperfine structure. 

It might at first sight be thought that to treat the electron- 
nuclear interactions we would have to know in detail the motion 
of the nucleons inside the nucleus, which in turn depends on the 
nature of nuclear forces. This is to some extent true as regards 
the items in category (i) above, i.e. if one wants to calculate some 
of the isotope shifts. However, as regards the major features of 
the electron-nuclear interactions (category (ii) above), we shall 
show that the nucleus can be considered as a mass point with 
charge Ze, with an intrinsic angular momentum{ I of magnitude 
A{I(I + 1)]/2,a magnetic moment p», and with electric quadrupole and 
higher moments. Here J is the nuclear angular momentum quantum 
number, called the nuclear spin for short. We shall show that the 


+ In this section we shall have no occasion to use the infinitesimal rotation 
operators, so that the present use of I and 7 should cause no confusion. 
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magnetic moment p, is always parallel to the angular momentum 
I, so that we write 


(21.1) 


Here yy, is a measure of the magnetic moment, and is by convention 
loosely referred to as the magnetic moment. Strictly of course the 
magnitude of tn is 


VU + 1)) 


i rs as 


I 


from (21.1). 

We shall then show how the interactions between the nucleons 
and the electrons, and the energy splittings these give rise to, can 
be expressed in terms of the various nuclear moments. From the 
point of view of this section, we shall regard the values of these 
moments as arbitrary parameters to be determined by experiment. 
Clearly it is one of the aims of nuclear theory to calculate the 
moments from first principles, but at the present time this can only 
be done very approximately, and we shall not consider this aspect 
further. 


Nuclear spin and parity 
Let us first consider a system of A interacting nucleons not acted 
on by any external fields. The Hamiltonian is 


2 lve é U 
H = —-h 2 in Vi? +4 oot ("i — 14). (21.2) 


Here Uy is the interaction energy between nucleons 7 and j, which 
is not necessarily the same for all pairs of particles, and which may 
also depend on the nucleon spins and even their momenta. This 
Hamiltonian can be expressed in terms of the co-ordinates R(X, 
Y, Z) of the centre of mass, and the co-ordinates r’; of the particles 
relative to the centre of mass. 


A A 
MR = > mii, M=>m, r;=r;—R. 
1 1 
R and the r’; (i = 2, ..., A) are independent co-ordinates, and 
r’, is a dependent co-ordinate given by 


A 
mr’; = 2, mar". 


THEORY OF FREE ATOMS AND IONS 191 
Then (21.2) becomes 


I = Hom + A int, (21.3) 
i Re (at at at 
a NS a — parsers 
Hom = — a9 Vem sa axe + ot oz) 


< 1 f i? , ? 
Hoy = —WD 5 Wd + gp Gt + 4S Ule's — 89, 
3 


@ a 2a A 
Vi=\|a>57> ar) G =D V'- 
(az ‘ zz) a 


Here # om determines the motion of the centre of mass and Wnt 
the internal motion of the nucleons relative to the centre of mass. 
There are no “cross terms’ involving both R and the r’;, so that in 


the absence of external potentials the eigenfunctions of # can be 
written rigorously in the form 


W = tom(R)dint(t’s, - - » P'a)s (21.4) 


where ¢int Specifies the internal state of the nucleus. Now Hint is 
invariant under rotations of the r‘;, so that the eigenfunctions 
dint can be sorted into degenerate sets transforming according to 
D, This defines the nuclear spin I (nuclear angular momentum 
quantum number) already introduced. In addition the nuclear 
interactions Uy are known to be invariant under inversion to a 
very good approximation (§ 29), so that we can associate a definite 
parity w with the set of states belonging to each energy level (§ 15). 

We next suppose that the nucleons are moving in an external 
electric potential V(r), and add 


A A 
> V(r) = eZV(R) + [2 eV (r;) — eZ v(R)| (21.5) 


to the Hamiltonian, where eZ is the total nuclear charge. The first 
term gets included in # em and determines the motion of the centre 
of mass, whereas the bracketed term in (21.5) is a cross term that 
mixes different states of the form (21.4). However, it is extremely 
small, of the order of 10-? electron volts, compared with the excita- 
tion energy of the levels of #int which is of the order of 108 electron 
volts. Thus the amount of mixing of levels with different J and w 
is negligible, and for almost all practical purposes the wave function 
for the whole nucleus can be written as a single function of the 
form (21.4). Expressed in physical terms, the external field does not 
influence the internal motions of the nucleons appreciably, so that 
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we can regard the nucleus as a rigid thing with fixed angular momen- 
tum and fixed magnetic and electric moments. These properties are 
unaffected by the surroundings, in particular by the motion of the 
atom’s electrons, although the nucleus can of course rotate as a whole. 


The absence of an electric dipole moment. Higher moments 


Classically the electric dipole moment of the nucleus relative to 
the centre of mass is given by the operator 


A 
Detass = >. eqns, (21.6) 


where e; (=e or 0) is the charge of the 7*® nucleon. Let ¢ be any 
state in the vector space spanned by the ground state wave functions 
dint(Z, Mr). Then from the theorem (13.8c) we have that 


because ¢*¢ has even parity w* = +1, and Detass has odd parity, 
ie. changes sign under r; = —r;. The expectation value of the 
classical orbital dipole moment Deiass is thus always zero. However, 
there still remains the possibility that the nucleons have an intrinsic 
dipole moment, in the same way as electrons and nucleons have 
intrinsic spin angular momenta and spin magnetic moments. 
Suppose this is represented by an operator Dgpin. For us to interpret 
this quantity as an electric dipole moment, it must manifest itself 
through the occurrence in the Hamiltonian of an interaction term 


—é: (Detass + Dgpin) 


with an external electric field &. Now all known electromagnetic 
interactions are invariant under inversion, so that Dgpin must 
behave in the same way as Dejasg under inversion. Consequently 
the expectation value of Dgpin is also zero, and nuclei exhibit no 
permanent electric dipole moments.f 

In the same way it follows that nuclei can have no electric multi- 
pole moments of order 2' where | is odd, and no magnetic multipole 


¢ The same result applies of course to any system interacting with parity- 
conserving forces, In this strict sense the ground state of an asymmetrical 
molecule does not have a permanent dipole moment, and it corresponds to a 
spherically symmetric wave function with the molecule pointing in no particu- 
lar direction in space. However a molecule has a large number of rotational 
levels separated from the ground state by energies normally small compared 
with kT (§§ 23 and 24). This gives the system certain properties at ordinary 
temperatures which are analogous to those of a classical dipole moment. 
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moments of order 2! where lis even. The other moments are allowed, 
e.g. magnetic dipole, electric quadrupole, etc. (For definitions of 
higher order multipole moments, see Stratton (1941) and Schwartz 
(1955).) A nuclear 2-pole moment must therefore be an electric one 
if lis even and a magnetic one if lis odd. Hence the words “electric” 
and ‘“‘magnetic”’ are usually omitted without causing any confusion. 
It should be noted how the proof that the nuclei have no dipole 
moments depends on the fact that the internal wave functions ¢ 
have a definite parity --1, which in turn followed from the invariance 
under inversion of the nuclear interactions Uy. Real nuclear inter- 
actions can be divided into three classes; the strong interactions of 
the order of 10 MeV. which are responsible for nuclear binding and 
most other properties, the electromagnetic interactions of the order 
of 1 MeV., and the much weaker interactions of the order of 1 eV. 
which are responsible for beta decay (§ 29). Of these, the last is not 
invariant under inversion, so that nuclei are expected to have some 
electric dipole moment which, however, is too small to detect 
experimentally (Ramsey 1953, Lee and Yang 1956). 


Limitation of multipole moments by I 


Let M“ be an electric or magnetic multipole moment of order 
2'. This can be expressed as a linear combination of 21 + 1 irreducible 
multipole moments My," which transform according to the irreduc- 
ible representation D of the rotation group (Stratton 1941, p. 
182). The products Mm™¢ini(Z,Mr) transform according to 
>D where (§ 9) 

[l—i <j < I+. 


Hence by the fundamental theorem of § 13, the ground state matrix 
elements (I, M’r|Mm™|I, M7) are all zero unless 


\lJ—lj<I<I+l 
ie. unless 1 < 2/. 


Thus a single nucleon or any J = } nucleus can only have a magnetic 
(dipole) moment, and no quadrupole or higher moments. Similarly, 
an I = 0 nucleus has only a zero order moment, namely its charge Ze. 


The magnetic dipole interaction 


The operator of the nuclear magnetic moment due to orbital 
motion can be written down classically, 


A 
en,orb = >: (e4/2me)r’s A D's, 


1 


194 GROUP THEORY IN QUANTUM MECHANICS 


where -p’; is the momentum of particle 7 relative to the centre of 
mass. Clearly t2n,orp transforms under rotations and inversion in 
the same way as the orbital angular momentum operator Jr’; A p's, 
and this is still true if we include spin magnetic moment and spin 
angular momentum. (This latter fact was proved in detail in con- 
nection with the electron spin; see particularly below equation 
(11.18) and appendix F.) It follows from §13 that the matrix 
elements of tn and I are proportional within the vector space of 
the internal ground state wave functions ¢dint(Z, Mz), 


<I, Mr\palI, M'r) = (const)<I, Mz|I|I, M’r. 


The proportionality constant is conventionally written as jn/(hZ), 
and since we are only concerned with the ground state of the internal 
Hamiltonian # int, we can write effectively 


I 
Pn = Paz (21.1) 


This proves that , and I are always parallel. Similarly Heiect, 
the magnetic field at the nucleus due to the orbital motion and spin 
magnetic moments of the electrons, is proportional to their angular 
momentum J, provided we restrict ourselves to considering only 
one particular (say the lowest) electronic energy level: 


Heiect = (const) J. ~ (21.7) 


Now the magnetic interaction energy between the electrons and the 
nucleus is 
Hy = —Pn°* Heiect, (21.8) 


and after substituting (21.1) and (21.7) this becomes 


Hy =al-S (21.9) 


Heiect’ J 
—asetS). (21.10) 


We now turn to consider the kind of energy levels given by an 
interaction Hamiltonian of the form (21.9). We defer to the next 
subsection the problem of calculating Heject so that {en can be 
determined from (21.10) once a has been measured experimentally. 
We consider only the set of low lying states. 


WI, My, J, Ms) == dgnueill, My)erect(J, Ms) 


in which the nucleus and the electrons are both in their respective 
ground states ¢nuci(J, Mr) and Yetect(J, My). The interaction 
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H y (21.9) is invariant under combined rotations of the electronic 
and nuclear co-ordinates together, so that the eigenstates of the 
whole atom can be designated by combined angular momentum 
quantum numbers F, Mp. 
YI F Mr) = > (JM1Ms|FMr)¥U,M1,J,My), 
IMS 


T—JpPererey (21.11) 


The relative energies E(F’) of these (2 + 1)-fold degenerate levels are 
obtained by applying the perturbation # y (21.9) to the first order 
of approximation 


E(F) = (IJ FMplal- J|\IJFM p>. 
These matrix elements can be calculated by exactly the same 
procedure as was used in §13 to calculate the fine structure splitting 
due to the spin-orbit coupling (L-S. If F=1+ J is the total 
angular momentum of the atom, we have 
21-J = F* — PP — J? (21.12) 
and hence 


E(F) = 4oh*(F(F +1) — 1 +1) — JJ +1). | (21.18) 


Usually it is the difference between successive levels that is measured 
by nuclear resonance techniques. 


E(F) — E(F — 1) =ah?F. 


Thus the intervals should bear simple numerical ratios to one another, 
and any deviation from these simple ratios indicates some omission 
in the theory, e.g. the existence of a quadrupole moment and 
second order perturbation effects from #% y,. 

If the atom is in an externally applied magnetic field Hp, the 
Hamiltonian must contain also interaction terms —p.- Hy for both 
the electronic and the nuclear magnetic moments. For the same 
reason that p, and I are parallel, the electronic magnetic moment 
we is parallel to J. From problem 13.6, it is given by 


‘J 
Ye = —918 hk 
where gz is the Landé g-factor 


JJ +N +588 +1) —-LL+) 


gu =) t DIT +1) 
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and f the electronic Bohr magneton. The Hamiltonian then becomes 
H =al- I + (9rB/h)I «Ho — pn * Ho. (21.14) 


Here pn is of the order of the nuclear Bohr magneton ef/2mnac 
which is about 2000 times smaller than the electronic Bohr magneton 
because of the larger mass my of a nucleon. We shall therefore 
neglect the last term in (21.14). We first consider the case of a 
very weak external field (< 100 gauss) so that BHy < ah?. Toa 
first order of approximation the nuclear and electronic angular 
momenta are still coupled as described by the total quantum 
number F, and from (21.13) the energy levels are 


E(F, Mp) = E(F) + (g1h/h)XIJFM p|J -HolIIFMpy. (21.15) 


Within a manifold of states with the same F, the matrix elements 
of J must be proportional to those of F because they are both 
operators transforming according to D®), We can therefore write 
effectively [see (13.20)] 


J = kF, (21.16) 
where the constant k is evaluated from the identity 
I? = (F— J? =F* 4+ J?— 2F-J 
= F? 4 J? — 2kF*. 


If we choose the z-axis along the direction of Ho, the energy levels 
(21.15) finally become 


E(F, Mp) = yah F(F +1) — 1 +1) — JJ +1) + 
FF +) +53 +)—10+)) 


oe a ry i os | ab 


(21.17) 


Now if the field is rather larger (of the order of 104 gauss), we have 
that BH, > ah*, so that the electronic and nuclear angular momenta 
are decoupled and the states are designated to a first approxima- 
tion by the quantum numbers J, M,;, J, M;. Also BHp is less than 
the fine structure separation, so that electronic energy levels with 
different J do not get mixed and the approximation (21.9) is still 
valid. We have 


I-J =}(LJ_ + 1J,) + Is, 
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and only the last term gives any diagonal matrix elements in the 
M,, M, representation. The energy levels of (21.14) therefore are 


E(M,, M;) = ah?M ,;M, + 9, PHoM ;. (21.18) 


Fig. 12 shows schematically the variation of the energy levels with 
H, as they change from the low field region in which the approxima- 
tion (21.17) is valid, to the approximation of (21.18). 
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Fic. 12. Variation of the energy levels in a magnetic field for the 
case I = 3/2, J = 1/2,a > 0. The energy is plotted in units of ah* 
and the field in units of ah?/g,p. 


Calculation of the interaction constant a 


We shall now calculate the constant a in the magnetic interaction 
H yy (21.9). We regard the nucleus as a magnetized sphere with 
total magnetic moment tn, and we have to calculate separately 

(i) the interaction with the main part of the electron distribution 

which lies outside the nucleus, and 

(ii) the interaction with that part of the electron distribution 

which penetrates inside the nucleus. 
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We consider part (i) first. The orbital motion of the electrons 
produces a jaca field H,,, at the nucleus, given by 


vi A (—Pi) e pi A 5: A Ty 
Hom = ce ry? ~ me are 2p > 55 hr? 


The interaction energy of the nucleus with the electronic spins is 


—2B[si° Un  3(8i°Ts)(Un' Fs) | 
—Hypin'Pn = >, h rs = rye : | 
t 


Putting He1ect = Horn + Hspin, we obtain from (21.10) 


— 2pnB <5 >— — 8: , 3(Se'rs)rs 
an = TFT Ve A +8 >. 


(21.19) 


The matrix element must be evaluated for the particular electronic 
level considered. Closed shells make no contribution because the 
1, and s; add up to zero. For a single electron outside closed shells 
(21.19) reduces to 


ak? — Hol LL +1) =| >> (21.20) 


JJ +1) +1) 


as the reader should have no difficulty in showing by following the 
hints in problem 2.15. 

We now turn to the contribution (ii) above to the interaction 
constant a. Near the origin, a single electron wave function behaves 
like (const)r?, so that only an s-electron has any appreciable proba- 
bility of being inside the nucleus. Furthermore its orbital motion 
is spherically symmetrical so that we need only consider the inter- 
action of its spin magnetic moment —(28/h)s with the nucleus. 
The nucleus is idealized as a smal! sphere of volume V with a uniform 
magnetization . 

M = u,/V. 


The magnetic flux density B, inside the nucleus is then 


(21.21) 


It does not matter for the purposes of the present calculation 
whether the magnetic moment is thought of as being due to circulat- 
ing currents corresponding to the orbital motion of the nucleons, 
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or due to a permanent magnet corresponding to the intrinsic 
moments of the nucleons, or indeed as due to a mixture of both. 
The flux density By, is given by (21.21) in all cases. The potential 
energy of a magnetic moment py. introduced into this field ist 


—p," Ba. (21.22) 
The interaction energy is therefore 
Hy = —Bn‘ > (—2Bsi/h) (probability of electron being 
t inside the nucleus) 


= — (Foun) >, (20m WO/*7) 


t 


x” _ 16m pnB 1: Ds¢|¥(0)|? 
eS So i 


ie. (21.23) 


This can easily be expressed in the form 
H x= al- J 


using the trick of (21.10). A pair of electrons differing only in 
having opposite spins give no resultant contribution to (21.23). 
However, for a single s-electron outside closed shells, we have 
Ss; = J and obtain 


ah? — 16m HB (0) |. (21.24) 


3 «Od 


This is the most important case for practical purposes. 


Quadrupole interaction 


We start by writing down in full the Coulomb interaction between 
the protons in the nucleus and the electrons. Let rz, and rzz be 
the positions of the «® proton in the nucleus and the A'" electron. 


+ It is the magnetic field intensity H that is not the same for the different 
cases. We have H = (8r/3)M if the magnetic moment is produced by a 
current distribution in vacuum, and H = —(47/3)M if the sphere is a per- 
manent magnet. However, the magnitude of H is irrelevant to the calculation. 

¢ Formula (21.22) is completely equivalent to (21.9). Firstly the inter- 
action energy between two magnetic moments f4, and {4n can always be written 
in the two ways W = —p.: Bn = —Un~ Be depending on which one is 
thought of as being brought from infinity up to the other one. Secondly in 
(21.9) and throughout this book we should use B instead of H (Stratton 
1941, p. 130). However, since it only makes a difference in the present para- 
graph, convention has been followed and we have everywhere else written 
H although we mean the magnetic flux density B measured in gauss. 


14 


200 GROUP THEORY IN QUANTUM MECHANICS 


If we neglect the electron density actually inside the nucleus, we 
have r, always greater than r,, so that the Coulomb interaction 
energy can be expanded in terms of Legendre polynomials Py(cos w) 
(Stratton 1941, p. 172). 


H coulomb = >a 
pee ap I'ne — eal 


—e? 


= —e? > E + P,(cos w,,) + 


a, B 
+ as P2(008 wag) +. « | (21.25) 


where w,, is the angle between r,, and fg, 


Tnalep COS Wap = Lyghep + Ynailes + %naeps (21.26) 
P,(cos w) = cos w, P,(cos w) = $(3 cos? w — 1). 


In this expansion, the first term gives the potential —e?Z/r, of the 
nuclear charge, and this has already been included in the Hamil- 
tonian # orp of Chapter II. The next term can be put equal to zero, 
because it leads to expressions involving the components ¢27z,,, 
etc., of the nuclear electric dipole moment which we have shown to be 
zero. The third term in the expansion (21.25) is called the electric 
quadrupole interaction #9. Using (21.26), we may write it as 


Hq=-t 2 Qis(VE)ay, (21.27) 
Qu =e> [3 (nat? nas + Pnag? net) — Sy" na?) 
ae | 
(VE)y =e > gb lhe epitens + TepjTepi) — S4yten”), 
2 


where the 7; are the components z, y, z of r. Here the Qy are the 
quadrupole moment operators of the nuclear charge distribution, 
and the (V@); are the operator components of the electric field 
gradient at the nucleus due to the external electrons. 

The expression (21.27) can be written in a much more convenient 
form. The r,,,4 transform under rotations in the same way as the 
component J; of the angular momentum I, and hence the Qj; trans- 
form in the same way as 


RIL; + Ijls) — 8y12. (21.28) 


Furthermore, these expressions form a symmetric second rank 
tensor with zero sum of diagonal terms, so that they transform 
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according to the irreducible representation D) of the rotation 
group (problem 9.8). Thus the fundamental theorem (13.8) can be 
applied, and inside a set of states with the same J the matrix elements 
of Qi and (21.28) are proportional, so that we write effectively 

Quy = CLR LiL; + II) _ 817), (21.29) 
‘ where the constant OC does not depend on #, j. C is customarily 
expressed in terms of another constant @ called “the” quadrupole 
moment of the nucleus and defined by 


eQ = I, My = T|Q22\Z, M;=TI) 
= | pn(M = 1) [32n* — rn®] den 
= CL, My = 1/812 — 1|1, My = 1) 
= Oh? (31? — (I + 1)) = MOTI — 1), (21.30) 


where pn(M,; = 1) is the nuclear charge distribution for the state 
M, =I. Similarly ee can be written 


(VE) = 5 Bes + Spe) — ByS*] (21.31) 


hed ae 
in terms of the constant 


1 3z,2 — re? 
qa =—> | pels =J) | dre. (21.32) 


Therefore 
= eqs : 
#o = ser yjed ye 2 [Udy + Lele) — By") 


x [8(Sty + yds) — 5yI*). 


This can be simplified further. Since #gQ is invariant under simul- 
taneous rotation of nuclear and electron co-ordinates, it must be 
expressible in terms of the only available invariant combinations 
I- J and I2J?: 

Hg = a(I-J)? + BIJ) + yD?S?. 
The coefficients «, 8, y can be evaluated by putting thJ,=—JaJy 
—JyJz, etc., in the BI-J term, imagining everything multiplied out, 
and comparing coefficients of I,7Jz?, I,°Jy? and IzlyJ yr. 
obtain (Ramsey 1953) 


3e%q7Q 4(I-J)?  21-J or 


#0 = sTaF— Jar — as 7) 


(21.33) 
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The contribution of #@ to the energy levels can be evaluated 
easily in the limit of very small or zero magnetic field where F and 
My are good quantum numbers. We have from (21.12) 


CII FM p|21-I3|IJ FM p> 
=WK =F F +1)-—10+1)—JVvU 4+ 1), (21.34) 
and hence 
Eg= (FM p|# o| FM p> 


3e7q7Q : 
= S7al — ses EK +) — 8 + IT + 1}. 
(21.35) 


Alternative treatment of quadrupole interaction 


The following, more elegant, derivation of (21.35) illustrates the 
method developed by Schwartz (1955) for handling multipole 
interactions of any order using Racah coefficients. As shown below, 
4 g can be expressed in the form 


Hg = —4 > (—))™On(VE)_m, (21.36) 
where 


Qo = Qzz, O41 => V $(Qaz + yz); 
Qs = V4(Qzx = Qyyt2tQzy), 


and the (V@)m are defined similarly with (V&)) = (V&)z. By 
referring back to the definition (21.27) of the Qzz, etc., it is seen 
that the five quantities Q, or (V&)m are related to one another 
just like the spherical harmonics Y,,m (see below equation (8.21)). 
The Qm and the (V&)m therefore transform under rotations as 
standard base vectors according to the representation D‘2, The 
result (21.36) is now proved as follows. From (21.25), #g trans- 
forms irreducibly according to D®) under rotations of the nuclear 
co-ordinates alone, keeping reg fixed, say along the z-axis. Thus 
#q is a linear function of the Qm, and similarly of the (V&)m. 
Also #Q is invariant under rotations of nuclear and electronic 
co-ordinates together, and so is (21.36) (see the discussion below 
20.40)). The products Qm(V@)m’ transform according to D‘® 
x D™), and since this includes the representation D® only once, 
Hg must be proportional to (21.36). Finally the numerical con- 
stant can be written down by inspection from the coefficient of 
zn 22". 

In the absence of a magnetic field, the contribution of #9 (21.36) 
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to the energy levels can be evaluated using formula (20.41) for the 
value of the matrix element. 
EQ(F) = J FM p|—} > (—-)1)™On(VE)_m|IJ FM p> 
= —H 19 F TD Qmll >< (VE) mil > W (LIL F2). 
(21.37) 


The reduced matrix element <J||Qm||Z> can be expressed in terms 
of the quadrupole constant Q by using formula (20.34). From 
(21.30) we have 

eQ = <I, Mr = I{Qo|I, Mr = 1) 


area SZIQmliZ> 
/5 


= (=i (I, I, — I, 1|2, 0). 


Here the last bracket is a Wigner coefficient which is evaluated 
from formula (20.26). 


/5(21)! 
= Ey an | roar ree 
(2, 2, — 1, 12, 0) = (—1)" ap — ar + ape 
Therefore 
21 — 2)\(2I + 3)!}v2 
(IIQnilZ> = og WA AVE SYN ase) 
— (2L)! 
and similarly 
_ [(27 — 2)'(2F + 3)!}1/2 
(IVE) mlT> = egy 
Making use of the symmetry properties of the Racah coefficients 
(problem 20.7), we have 
W(LJIJ F2) = W(IIJJ2F), 


and in the latter form the coefficient can be looked up in the tables 
given by Biedenharn ef al. (1952). 
(2F — (27 — 2)!] 1/2 
(22 + 3)'"(2J + 3)! 

x 6[K(K +1) — $11 + 1)J(J + 1)], (21.40) 
where K is defined by (21.34). Now substituting (21.38), (21.39) 
and (21.40) into (21.37), we just reobtain the result (21.35) for Hg. 


(21.39) 


W(ILJI2F) = (—1)F-!-J 


References 


The subject of hyperfine interactions is covered in the books of 
Kopfermann (1940), Casimir (1936) and Ramsey (1953), where 
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further references may be found. Schwartz (1955) has developed 
the general theory in a form applicable to multipole moments of all 
orders. The theory of hyperfine structure of paramagnetic resonance 
spectra in crystals has been given by Abragam and Pryce (1951a), 
and the subject reviewed by acca and Stevens (1953) and 
Nierenberg (1958). 


Summary 

It has been shown that the nucleus may be considered as a rigid 
system with a fixed amount of angular momentum and various 
magnetic and electric multipole moments. The magnetic dipole 
moment is always parallel to the angular momentum. Not all 
orders of multipole moments exist, the number of non-zero ones 
being limited by parity considerations and by the nuclear spin J. 
In particular there is no electric dipole moment. The interactions 
of the magnetic dipole moment and the electric quadrupole moment 
with the atomic electrons have been discussed in detail. 


PROBLEMS 


21.1 Show from (21.5) that a strong external electric field can 
produce an induced electric dipole moment in the nucleus. This 
polarization of the nucleus occurs for instance when the atom’s 
electrons penetrate the nucleus, and it can lead to an appreciable 
shift of the energy levels in heavy nuclei, the magnitude of the 
shift depending on the particular isotope. Make an order of magni- 
tude estimate of this effect (Kopfermann 1940). 

21.2 Draw an energy level diagram analogous to Fig. 12 for the 
cases J = 1, ] =}, and J = 3, I =}. 

21.3 Calculate (JF Mp|—pn* Ho|IJ FU r> assuming Hy to be 
along the z-axis. 

21.4 Calculate the selection rules for magnetic dipole transitions 
between the levels of Fig. 12, (i) for low magnetic field and (ii) for 
high magnetic fields. Neglect those transitions which are induced by 
the coupling with the nuclear moment because their intensity is only 
about (1/2000)? times that of the electronically induced transitions. 

21.5 Deduce the value (21.20) of the coupling constant a for a 
single electron from equation (21.19). Hints: put J=1-+s; 
l-r=r A p-r=0; use the relations of problem 11.1 valid for 
a single spin only. 

21.6 With the notation of (21.27), (21.28), why is it not correct 
to write . 

C1, Mr|rirs I, M'1) = CCL, MrT; |, M'r> 


where C is independent of 1 and 7? © 
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21.7 Someone tried to calculate the constant k& of (21.16) by 
the following procedure. Why is it wrong? He put I=F— J 
== (1 — k)F, and then F? — I? — J? = 21- J = 2k(1 — k)F2, 


F(F +1) — 11 +1)—JVJ +) 
~ F(F $1) : 


which he solved as a quadratic equation for k. 
21.8 In deriving (21.35) from (21.34), one has to use the fact 
that 
(F, Mp|(I- J)?|F, Mp> = {(F, Mell - J|F, Mr>}?. 


Prove this. 

21.9 Calculate (JM1Mys|4#q|IJM;My> from either (21.33) 
or (21.36). 

21.10 Use the method of equations (21.36) to (21.40) to calculate 
the contributions of the magnetic dipole moment and the octupole 
moment to the energy levels in zero external magnetic field (Schwartz 
1955). 

21.11 A paramagnetic ion with a single 3d electron is in a crystal- 
line electric field, such that the wave function of the lowest (orbi- 
tally non-degenerate) level has the symmetry of x? — y?, Calculate 
the hyperfine interaction, and show that it can be expressed in the 
spin Hamiltonian form (§ 18) 7.4.8, where A is the tensor 


ie ee 


Calculate the value of 6 in terms of the expectation value <|1/r?|> 
for the 3d orbit. Note that the orbital momentum is completely 
quenched. 

21.12 The Mn++ ion has the configuration . . . (3s)?(3p)®(3d)5, 
the lowest term being &8. Is there any s-electron contribution 
(21.23) to the hyperfine structure (i) in the independent-electron- 
orbitals, Hartree-Fock approximation, and (ii) with a correct many- 
electron wave function? How does the situation differ in a term 
with D + 0? (Pratt, 1956; Abragam ef al., 1955; Heine, 1957). 


ive. 2k(1 — k) 


Chapter V 
THE STRUCTURE AND VIBRATIONS 
OF MOLECULES 


22. Valence Bond Orbitals and Molecular Orbitals 
Electron wave functions 


The purpose of this section is to discuss the electronic wave 
functions of molecules. We shall assume for the present that the 
nuclei of the atoms in a molecule are stationary, fixed to their 
equilibrium positions. The Hamiltonian for the electrons moving 
around the molecule is 


#=--E Sve Dro t >>, ea) 
ry ry t<j 


where the first term is the kinetic energy, — V(r) is the attractive 
potential of the stationary nuclei, and the last term is the mutual 
Coulomb repulsion between the electrons. As in § 12, the wave 
function WY has to be antisymmetric under the interchange of 
electron co-ordinates r;, and this is achieved automatically by 
writing ¥ as a determinant or as a sum of determinants. As in the 
case of an atom (§ 10), an exact wave function has to be represented 
by an infinite series of determinants because of “configurational 
interaction”, but in practice the first one or two determinants 
serve as a reasonable approximation for many purposes. The 
elements of these determinants are one-electron wave functions 
w(r)u, called orbitals. As we shall see, these are usually chosen to 
belong to one of three types, atomic orbitals, valence bond orbitals 
and molecular orbitals, depending on the circumstances. In each 
case one wants orbitals with definite transformation properties, 
so that one can use these properties in calculating matrix elements 
and energy values, etc. 


The hydrogen molecule 


In order to introduce the different types of orbital, we start by 
considering the hydrogen molecule. We take the two protons to 
be at the positions 


R, = (0, 0, —}a), R, = (0, 0, 3a). 
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The potential energy V(r) of an electron is shown in Fig. 13a, and 
includes the nuclear potential Vn(r) as well as the self-consistent 
average potential of the second electron. The potential is symmetrical 
about the origin and the electron can penetrate easily from the one 
hydrogen atom to the other one. The eigenfunction pp with lowest 
energy is the orbital shown qualitatively in Fig. 13b. We can have 


nee va b) 


Fig. 13. The hydrogen molecule: (a) the potential energy of one 

electron in the molecule; (b) the bonding orbital ¥,; (c) the anti- 

bonding orbital ¥,. In each case only the value along the line of 
centres (z-axis) is plotted. 


two electrons in this orbital, one with up spin and one with down 
spin, and the wave function for the two electrons in the H, molecule 
becomes 
Pr(0y)u,,  or(Fi ua 

; 22.2 
Po(e)tis Yn(P2)U_2 ( 


Since the orbital yp has a low energy, it gives the whole system of 
electrons plus nuclei a low energy, which means physically that the 
two atoms are bound together in a stable molecule. The orbital 
is therefore called a bonding orbital. The orbital %a with next lowest 
energy has the form of Fig. 13c. It is called an anti-bonding orbital 


Wyo = (21-12 
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because it has a rather higher energy than yp and hence would 
detract from the binding of a molecule. For example in H, only the 
bonding orbital is occupied, which leads to a well-bound molecule. 
On the other hand in the molecule He,, the bonding and anti-bonding 
orbitals are both occupied by two electrons each, so that the molecule 
is unstable. The origin of the energy difference between a bonding and 
an anti-bonding orbital can easily be seen from Fig. 13. The charge 
density {* is approximately the same for yf and yp, so that the 
potential energy [y* V(r) dv does not give the major contribution to 
the energy difference between them. However, the kinetic energy is 


f ¥AL—(4/2m)V2 do _ (H/2m) f |Ve|2do ng 
[ei—mamivtide _ wm) {alta ao 


From Fig. 13, Vy is clearly much smaller for yp than for ys, in the 
central region between the two nuclei, so that ¥p has a considerably 
lower kinetic energy than yp. 

Near the nucleus R,, the attraction of this nucleus is much 
stronger than that of the R, one and vice versa, so that yp and pa 
are approximately hydrogen 1s wave functions near the nuclei. 
This gives us a rough analytical representation of yp and ws (see the 
dotted lines in Fig. 13b and 13c); 


Ym = dae(r — Ri) + pus(r — Re) (22.4) 
$a = pas(t — Ry) — yas(r — Ro). (22.4b) 


Clearly the more overlap there is between the functions centred 
on the two atoms, the lower the kinetic energy (22.3), and the 
stronger the bond formed by the electrons in bonding orbitals, other 
things being equal. 


The covalent bond 


It has been found that the electrons in polyatomic molecules 
can also be described often by bond orbitals, such as pp and yp, of 
Fig. 13, which straddle two neighbouring atoms. If a/a = 0 
midway between the atoms (or nearly midway between them if 
they are different elements), then the orbital is a bonding one 
%p and if 4 —0 halfway or nearly halfway between the atoms 
then it is an antibonding one y,. Each type can be occupied by two 
electrons, one with up and one with down spin. As in (22.4), dp 
and y, can be written 


tp = ¥,(r — R,) + ¢,(r — R,) 
Ya = ¥4(r — R,) — ¥.(r — R,) 


(22.5) 
(22.5b) 
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in terms of some atomic-like orbitals %, and 4. These need not be 
ordinary 1s, 2s, 2p, etc., atomic orbitals which are symmetrical about 
the nucleus R, or R,. As already mentioned, the more overlap 
there is between ¥,(r — R,) and %,(r — R,), the stronger the bond 
that is formed. The strength of the bond therefore tends to be 
increased by having lopsided orbitals, stretching out towards one 
another from their respective nuclei R, and R, as shown for the 
carbon-carbon bond in Fig. 14. These lopsided atomic orbitals are 
called directed valence orbitals. 


Fie. 14. Carbon-carbon bonding orbital. The bonding orbital is the 
sum of the two directed valence orbitals shown dotted. 


A single directed valence orbital for a carbon atom is shown in 
Fig. 15a. Near the nucleus, the wave function must approximate 
to a 2s or 2p orbital, since the 1s ones are already filled and higher 
ones like 3s, 3p, . . . have too high an energy. Fig. 15 shows how 


% 


(a) 


(b) 


(c) 


Fie. 15. A directed valence orbital for carbon (a) can be seen to be 
expressible as the sum of the 2p and 2s atomic orbitals (b) and (¢). 
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the directed valence orbital can actually be formed as a linear 
combination of a 2s and a 2p atomic orbital. This is very satis- 
factory as regards one directed orbital by itself. However, as is 
well known, the carbon atom tends to form four valence bonds 
with its neighbours, directed as if towards the corners of a regular 
tetrahedron (i.e. alternate corners of a cube) as indicated by A, B, 
C, D in Fig. 16. The main problem is to show now that four orbitals 


3 4 


3 

\ 

K| 
Veh ay On fl 


— eae} 


co 


Fie. 16. Symmetry of the directed valence orbitals of carbon. A, B, 
C, D are the directions of the four orbitals, shown with reference to 
a cube. The four orbitals transform into one another under the point- 
group 43m, a few representative symmetry elements being shown. 


pa, vp, Yc, wp, directed as described and all equivalent to one 
another, can indeed be formed from atomic 2s, 2p, 2py, 2pz functions. 
This can be proved as follows. The directed orbitals are transformed 
into one another by the point-group 43m, and form a representation 
D of this group. The 3-fold axis shown in Fig. 16 induces the 
transformation 


A->A, B>C, C~D, D—B, 


the transformation matrix being 


] . 
r a 
1. 
1 
We have therefore for the three-fold rotation the character 
x(3) = 1. 


Since the character is the sum of diagonal matrix elements, we only 
get a contribution to it when an orbital is transformed into itself, 
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such as A above. Thus we can write down by inspection y(4) = 0, 
x(m) = 2, x(2z) = 0, x(E) = 4. From the character table of the 
group 43m (Table 16) and from the decomposition law (14.2), we 
can reduce the representation D into 


D=A,+T,. (22.6) 


Also 4(2s) is spherically symmetrical and transforms according to 
A,. ¥(2pz), p(2py), ¥(2pz) transform like x, y, z, according to the 
representation 7’. This can be seen by inspection, or from (14.4), 
or it can be looked up in appendix K or Table 16. Thus (2s), 


TABLE 16 
Character Table of Point-group 43m 


Representation Characters 
E 4, 2, 3 m 

As 1 l 1 1 1 rt 

A, 1 —l 1 1 —1 

E 2 0 2 —1 0 

T, 3 1 -1 0 —1 

T. 3 —1 —l 0 1 H, Y, z 
(see (22.6)) 


2(2pz), (2py), 4(2pz) also transform according to D (22.6), whence 
it follows that we can form linear combinations of (2s), (2pz), 
W(2py), ¥(2pz) which have the required symmetry properties of 
va, ¥B, wc, vp. The correct linear combinations, which can be 
found by using the projection operators (14.11) or by inspection, 
are (not in normalized form) 


ba = Y(28) — Y(2pz) — P(2py) + H(2pe2) 
vp = $(28) + (2px) + p(2py) + (2pz) (22.7) 
to = W(2s) + o(2pz) — Y(2py) — ¥(2pz) 
op = ¥(28) — 2(2pz) + o(2py) — (2pz). 


Each of these directed orbitals forms a bonding orbital of the type 
Fig. 14 with the neighbouring atom in the appropriate direction. 
Combined with the two spin functions wu, and u_, they make eight 
spin-orbitals. These are all occupied, four of the electrons coming 
from the central carbon atom and one each from the neighbours. 
Thus near the carbon atom the wave function is like a full shell 
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(2s)#(2p)° atomic wave function. However, the spin orbitals have 
to be normalized to unity, and since they straddle two atoms, each 
electron has only a 50% probability of being on the central carbon 
atom. The total charge on the carbon atom is therefore correctly 
given as 8 x 4 = 4 electrons. 

We can now see why the directed valence bonds of carbon must 
point in the tetrahedral directions of Fig. 16. From the fact that 
only one type of methyl chloride (CH,Cl) molecule is known to 
exist, we infer that all the four valencies of carbon are completely 
equivalent to one another. They must therefore make equal angles 
with one another, which limits their arrangement to two possibilities, 
namely the tetrahedral one of Fig. 16 and the coplanar one Fig. 17a. 
We can now repeat the above analysis for the planar configuration 
Fig. 17a, and soon find that it is impossible to make such a set of 
directed valence orbitals out of the atomic orbitals ¥(2s), Y(2pz), 
(2py), ¥(2pz). The directed valencies of carbon must therefore 
point in the tetrahedral directions. 


: (a) (bo) 


Fic. 17. (a) Symmetry of four directed valence orbitals in a plane; 

(b) three directions at 120° in a plane; (c) six directions at 90° to one 

another, These are called octahedral directions because they point 
to the vertices of a regular octahedron as shown. 


From the above example, the properties of the type of binding 
between atoms which we have been discussing can be summarized 
as follows: 

(i) The electrons are in localized bonding orbitals straddling two 
atoms, so that each electron is “shared” between the two atoms. 

(ii) There are two electrons with opposite spin in each bonding 
orbital, and by means of the sharing each atom tends to build 
up a full shell of electrons. Usually the atoms contribute 
one electron each to the bond, but one atom may contribute 
both. 

(iii) The bonds are directed along certain directions in space. 
Such a bond is known as a covalent one. Table 17 shows what 
directed valence orbitals can be formed from various sets of atomic 
orbitals. Each of these orbitals is axially symmetric about the line 
joining the atoms and gives what is called a o bond, which is the 
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TABLE 17 
Directed Valence Orbitals 
Co-ordination Configuration Directions 
number used. of valence orbitals 
2 ep linear 
3 ep? trigonal, plane 
ds? trigonal, plane 
p trigonal pyramid 
4 ep? tetrahedral 
dsp* tetragonal, plane 
5 dts tetragonal pyramid 
6 d®ep* octahedral 
d‘sp trigonal prism 
8 sp*d*f cube vertices 


usual type of saturated covalent bond. Unsaturated bonds can be 
described by an extension of the theory (see below). 

When an atom has two incompletely filled electronic shells, there 
clearly tends to be a considerable degree of choice in which atomic 
orbitals to use in making up directed valence orbitals. Not only 
can we vary which orbitals to use out of each shell but also how 
many electrons are to be allocated to each shell. For instance in the 
transition elements Ti to Ni, the 3d, 4s and 4p orbitals have com- 
parable energy and are only partly occupied, so that they are all 
available for bond formation. This explains why these metals form 
such a variety of covalently bonded complexes, such as the ferri- 
cyanides, ferrocyanides, and the [Co(NH,)s]*++ complex (see 
problem 22.4 and Table 17). 


Molecular orbitals and the structure of benzene 


If an electron can penetrate from one atom to the neighbouring 
one and be described by a bond orbital as in Figg. 13 and 14, then 
there would appear to be no reason why it could not go on to the 
next atom and so on. This can indeed happen, and electrons are 
therefore often described by molecular orbitals which extend through- 
out the molecule.t As an extreme example, a macroscopic crystal 
of a solid can be considered as one gigantic molecule, and Fig, 18 
shows an orbital in metallic sodium. It is an infinite chain made up 
out of 33 atomic orbitals. 

By way of a more detailed example, we shall consider the structure 
of benzene. The six carbon atoms are known from X-ray structure 


t In the case of a diatomic molecule, molecular orbitals and valence bond 
orbitals are clearly the same thing. 
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work to form a regular hexagon in a plane, which we shall take as 
the x, y-plane. The C—C and C—H bonds make 120° angles with one 
another, and this suggests (Table 17) that we can describe them 


Fig. 18. Lowest energy one-electron orbital in sodium metal. 


by bonding orbitals formed from (2s), ¥(2pz), %(2py) atomic 
orbitals on the carbon atoms and (1s) on the hydrogen. This gives 
the bonds shown as lines in Fig. 19, and leaves six electrons to be 


H 


| 
eee 
| 


oy 


H 


Fig. 19. Bond structure of benzene. The bonds corresponding to 
(22.9) are not indicated. 


fitted into orbitals made up out of the y(2p,) functions on the six 
carbon atoms. Let us write yz as short for 


2 = ¥(2pz) = af (r), (22.8) 


and let Ry, (n =1 to 6) be the positions of the six carbon nuclei 
in Fig. 19. The six orbitals have symmetry 6mm but it is simplest 
to consider just the sixfold rotation (point-group 6). A symmetri- 
cally linked orbital forming a chain round the ring similar to Fig. 18 is 


to = > dele — Rn). (22.9a) 


This transforms according to the identity representation A of the 
point-group 6 (see appendix K). Analogously the five orbitals 


dar =D exp(tin aieelr — Rn), (22.9b) 


C4 
HO ~*~ 


tie = 2, exp(-£2in abel — R,), (22.9c) 


os = > (—1)"be(r — Rn), 
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transform respectively according to the representations H,, E, and 
B. These orbitals are not o ones like those we discussed previously 
because the yz orbitals are directed up and down perpendicular to 
the plane of the hexagon as shown by equation (22.8). They form 
what are called 7 orbitals (see below). Nevertheless yo is still a 
bonding orbital because of the way neighbouring atomic orbitals 
join up to reduce the kinetic energy. Likewise 4, has a nodal plane 
half-way between each pair of carbon atoms, and is an antibonding 
orbital. The other four ¥1,, $4. take up intermediate positions. 
In the case of benzene we take only the three orbitals yo, %.,, py 
with lowest energy, putting two electrons in each (up and down spin). 
The ls inner core electrons on the carbon atoms are held tightly 
to their own parent atoms, and are therefore described by atomic 
orbitals. The total wave function is then a 42 x 42 determinant of 
12 atomic spin-orbitals (carbon 1s electrons), 24 valence bond spin- 
orbitals (C—C and C—H o-type covalent bonds), 6 molecular spin- 
orbitals (C—C z-type covalent bonds). 

There is a point of great chemical importance attached to the 
use of the molecular orbitals (22.9). If we apply a perturbation at 
one carbon atom by replacing the hydrogen atom by some radical 
such as —NO, or —Cl or -OH, then the sixfold symmetry is destroyed 
and the molecular orbitals become linear combinations of the simple 
ones Yo, 11, 49, %s- Since the energy difference between 41, and 
#1 is not very great, there will be a considerable amount of the 
latter mixed into the occupied orbitals. Thus since the orbitals 
extend round the whole ring, the charge density on every atom is 
affected, whereas in a valence bond picture only the two nearest 
neighbours would be affected to a first approximation. For instance 
in the trimethylphenylammonium ion [N(CH,)3(CeHs)]}* analogous 
to (NH,)*, the positive charge resides mainly on the nitrogen atom 
and thus attracts electrons from the benzene ring towards the 
nearest carbon atom, say number 6 in Fig. 19, resulting in a general 
deactivation of the whole molecule. Moreover it can easily be shown 
(problem 22.6) that the removal of electrons takes place preferen- 
tially from atoms 1, 3 and 5. These atoms are therefore completely 
deactivated, so that any reaction of the molecule takes place at the 
meta position (atoms 2 and 4). On the other hand most radicals 
(—Cl, —Br, —I, —OH, —NH,) tend to lose electrons to the benzene 
ring for reasons which we shall not go into here (Pauling 1939, 
p. 142; Coulson and Longuet-Higgins 1947, p. 56). If, however, 
we accept this as a fact, then the above argument shows that the 
ortho and para positions (atoms 1, 3 and 5) will be preferentially 
activated. 


15 
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In solids a molecular orbital such as shown in Fig. 18 which 
traverses the whole of a macroscopic crystal, is called a Bloch 
orbital. In metals the fact that the electrons have such extended 
wave functions is intimately connected with their ability to move 
easily through the crystal and carry an electric current. 


Energy considerations 


So far we have put electrons into atomic orbitals, valence bond 
orbitals or molecular orbitals in a rather ad hoc fashion, and by 
using different orbitals, we can make up different types of wave 
function for the molecule. For example, we could write down an 
atomic orbital type of trial wave function for the hydrogen molecule 


Pao = (2!)7¥? [dre(t1 — Ria)tya yYus(ti — Re)ui 
Pis(Pe — Rijs, ypas(T2 — Re)w_el ” 


This contrasts with (22.2) in that here each electron is permanently 
on one atom. The deciding factor in the matter is, of course, how 
can the electrons arrange themselves into orbitals so as to achieve 
the lowest energy for the molecule as a whole. In general the wave 
function giving the lowest energy in this way is closest to the true 
wave function. It is therefore one of the tasks of theoretical chemists 
and of solid state physicists to find out what type of wave function 
is the appropriate one for any given situation, and why. We shall 
therefore consider in more detail the effect on the total energy 
of using different types of orbital. Our remarks will apply to all 
the types of orbital, Bloch, molecular, valence bond, and atomic, 
considered as a sequence ranging from the very extended Bloch 
orbitals to the highly localized atomic ones. 

The first principle is that the more extended an orbital, the lower 
is the kinetic energy (22.3), provided it is a bonding type orbital. We 
have already seen that this is so for a valence bond orbital com- 
pared with atomic orbitals (Figs. 13 and 14), and from the sodium 
wave function of Fig. 18 it is clear that we tend to achieve a con- 
tinually lower energy in this way as we string more and more 
atomic orbitals together. The kinetic energy therefore favours 
extended orbitals over localized ones. 

The second principle is that the Coulomb repulsive energy in (22.1) 
has exactly the opposite tendency: tt favours localized orbitals. An 
orbital always has to be normalized to one electron, so that the 
more extended an orbital the less the probability of the electron 
being on a given atom. The atom therefore has to be covered by 
more electron orbitals to achieve the correct charge on it. Although 


(22.10) 
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only a few of these electrons will be found on the atom on the 
average, there is still a non-zero chance of them all being there 
together. In this case the electrons are very close together, giving 
a high Coulomb energy Ze?/rj. For example, we have seen in a 
saturated organic molecule that the use of bond orbitals (22.5a) 
and (22.7) means that a given carbon atom has in a sense acquired 
a complete shell of electrons (2s)?(2p)*. On the average only half 
of these will be on the carbon atom at one time, but there is a 
probability of (4)® of them all being there together. Although this 
probability is small, it corresponds to an enormous Coulomb energy 
and may therefore be expected to give an appreciable contribution 
to the total energy. Similarly, we can consider the effect of 5, 6 and 
7 electrons being simultaneously on the one atom. 

The third principle is that small interatomic distances (relative to 
the size of the atomic-like orbitals) favour extended orbitals and large 
distances the localized ones. With extended orbitals, the coming 
together of many electrons on one atom gives a high Coulomb 
repulsive energy which does not vary greatly with interatomic 
distance since it is concerned with the state of affairs inside one 
atom. Qn the other hand, the lowering of the kinetic energy of an 
extended orbital depends on the amount of overlap between atomic- 
like orbitals on neighbouring atoms, and this is greatest for small 
interatomic separations. Thus the kinetic energy tends to win 
over the potential energy at small separations and vice versa at 
large ones. This is in accordance with common sense, as we shall 
see by considering the example of the hydrogen molecule. Suppose 
that we can alter the internuclear distance R = |R, — R,] arbi- 
trarily at will. For very large R, we have two separate atoms, each 
with one electron. We do not expect to find both electrons ever on 
one atom, and the localized atomic orbitals are the right ones to use, 
giving the atomic orbital type of total wave function Wao (22.10). 
At the opposite extreme for very small R, the two protons may be 
considered almost like a doubly charged helium nucleus, and the 
electrons revolve in molecular orbitals round the pair as a whole 
without being identified with the one proton or the other. This 
gives the molecular orbital wave function yo (22.2), in accordance 
with the third principle. At the equilibrium internuclear distance 
the molecular orbital function is still the better, corresponding to a 
covalent chemical bond. 

This example of the hydrogen molecule illustrates another point. 
The preceding discussion suggests that at an intermediate inter- 
nuclear distance, neither %yo nor Yao is a satisfactory wave 
function and that we should take a linear combination between the 
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two. Only in this way can we go over continuously from the small 
R to the large R limit (see problem 22.7). However, even this would 
only give an approximate representation of the correct wave function. 
The true wave function of a molecule or atom cannot be written as a 
single determinant or as a simple sum of two or three; it requires an 
infinite series of determinants. The use of a single determinant, 
as we are mostly doing in this section, is therefore just an attempt 
to obtain a simple semi-quantitative description of molecules. 
Only in this context is the discussion of when to use which type of 
orbital meaningful. Also the use of valence bond or molecular 
orbitals with special symmetry properties is just a part of this 
approximation. Nevertheless, as in the case of atoms (§§ 10-12), 
the total Hamiltonian of the molecule (22.1) has a definite symmetry 
group. Thus the wave function Y of the whole molecule can be 
assigned rigorously to an irreducible representation of this group, 
no matter how complicated "% may be when expressed in terms 
of one-electron orbitals. Any symmetry arguments about VY are 
therefore precise. 


Diatomic molecules 


We shall now discuss diatomic molecules in more detail, con- 
tinuing for the present to use the approximation of writing Y as a 
single determinant of one-electron orbitals. We shall further assume 
that with the atoms at their ordinary equilibrium distance apart, 
the molecular orbital description like (22.2) is a better approxima- 
tion to the true wave function than the Heitler-London atomic 
orbital one like (22.10). In the case of diatomic molecules, valence 
bond orbitals and molecular orbitals are of course the same thing. 
If the two atoms are different, e.g. CO, the symmetry group of the 
molecule is 00m. We choose each electron orbital to transform 
according to one of the irreducible representations of this group 
(Table 18). The notation A,, A,, Hj, ...is analogous to that for the 
finite groups, the alternative one 2+, 2-, IT, . . . being the usual 
chemical one. Capital letters, 2, 17, 4,... are used when the repre- 
sentations refer to states of the whole molecule, and small letters 
are often used for the same representations when these refer to one- 
electron orbitals. This is analogous to the use of S, P, D,... and 
8, p, da, ... states in atomic theory. The bonding and anti-bonding 
orbitals of Figs. 13 and 14 are seen to belong to the representation 
ot, It is impossible to devise a one-electron orbital of the o— type, 
because if it is cylindrically symmetrical under the rotations then it 
must be invariant under the reflection mz. The symmetry 2- can, 
however, apply to a whole molecular wave function (problem 22.11). 
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TABLE 18 
Character Table for the Group oo m 


Trred. Characters 
rep. E R(¢, z) Mg 
A, or 2+ 1 1 1 
A, or J- 1 1 —1 
E, or iT 2 2 cos 0 
E,or 4 2 2 cos 2¢ 0 
HE, or A 2 2 cos Ip 0 


The irreducible representations of the group co/mm are the same with 
an additional suffix g or u according to whether they are even (gerade) or 
odd (ungerade) under inversion. 


Orbitals of the other symmetry types can be constructed as 
follows. If the axis of the molecule is chosen as the z-axis, then in 
the notation of (22.8), 


(mz, bond) = #2(r — R,) + ¥’2(r — R,) (22.11) 


%(ay, bond) = py(r — R,) + #’y(r — R,) 


form a pair of bonding 7 orbitals. If the two atoms are unlike, 
y and yb’ may be different p functions, e.g. 2p and 3p ones. Together 
with the o orbital, these orbitals can be used to describe double and 
treble bonds as in acetylene H—C = C—H. The antibonding orbital 
can be formed similarly. When forming 6 and higher orbitals, it is 
helpful to note that a pair of functions transforming according to the 
general representation A (or H;) can always be chosen so as to belong 
to the representations exp(7l¢) of the axial rotation group. For 
example instead of the pair (22.11), we could use the linear com- 
binations 


Y(z,, bond) = (zz, bond) + i#(7,, bond), 


Wn, bond) = (wz, bond) — ip(ny, bond), 1”) 


which have this property. 

If the molecule consists of two like atoms, the symmetry group 
is oo/mm which contains the inversion as an extra symmetry 
element. The irreducible representations are the same as those in 
Table 18, except that each one has an additional suffix g (gerade) 
or u (ungerade) according to whether it is even (g) or odd (u) under 
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the inversion. (See the preamble to appendix K, and appendix L 
for further details.) 


The oxygen molecule 


Table 19 gives the different possible orbitals that can be formed 
out of the 2s and 2p functions on the two oxygen atoms, and which 
of these are occupied. They have been arranged in what is the 


TaBLE 19 
Orbitals in the Lowest Energy Configuration of O, 


Number 
Number of Irreducible Atomic Bonding 
Orbital of independent representa- functions or anti- 
orbitals spin tion bonding 
occupied orbitals 
og28 2 2 ogt 28 bond. 
og2p 2 2 ogt 2p z bond. 
Ty2p 4 4 Tu 2px, 2py bond. 
oy2s 2 2 Out 2s anti-b. 
mg2p 2 4 ™; 2pz, 2py anti-b. 
oy2p 0 2 Out 22 anti-b: 


A 


probable order of increasing energy. The 2s atomic function has a 
lower energy than the 2p one, so that the og2s bonding orbital has 
the lowest energy. Of the 2p orbitals, the one made out of two 
2pz functions (cg2p) will have the maximum amount of overlap 
and the lowest energy. Likewise the corresponding antibonding 
orbital oy2p has the highest energy. The 2p; and 2p, atomic 
functions have less overlap, so that the orbitals wy2p and ag2p 
made up from them have an intermediate energy. Note that for 
o orbitals the even one (g) is the bonding one, whereas for 7 orbitals 
the bonding one is odd (u). The precise order of the first four rows 
of Table 19 is actually somewhat uncertain, particularly as regards 
the position of the 2s antibonding orbital. This uncertainty is, 
however, quite immaterial, because if we put the twelve available 
valence electrons into the orbitals in order, only the last two types 
of orbital are incompletely filled and there is no doubt about their 
order. Thus the configuration of the molecule is 


(ogls)2(oyls)2(og28)*(og2p)2(ru2p)*(mru28)*(mg2p)2. ... (22.13) 


In this configuration, the zg2p shell is incompletely filled, only 
two of the available four spin-orbitals being taken up. Thus we can 
write down #0, = 6 different wave functions all belonging to this 
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configuration. As in the case of atoms, these wave functions group 
themselves into terms. All the terms can easily be found by extend- 
ing Slater’s procedure for atoms, and this is left as problem 22.13. 
As in the atomic case (§ 12), each full shell like (cg2s)? or (xy2p)4 
is completely symmetrical and need not be considered in deriving 
the symmetry of the total wave function. Here we shall just write 
down the ground term by inspection, using arguments analogous 
to. those relating to Hund’s rule in §12. The four 7,2p spin-orbitals, 
of which we have to choose two, are 


p(mz)u,, ab(ar~)u_, b(wy)u,, b(ary)u_ 


in the notation of (22.11), all being anti-bonding. The kinetic 
energy and the average self-consistent potential energy are the same 
for all of these, so what we need to minimize is the Coulomb energy 
e?/r,, between the two electrons. This is achieved by keeping the 
electrons as far apart as possible, i.e. by having one electron in a 
(zz) orbital and the other one in a %(7,) orbital since each orbital 
is zero where the other is at its maximum. Also as explained in 
§ 12, the antisymmetry of the wave function keeps electrons of 
parallel spin apart, and in this way the energy can be lowered still 
further by choosing the spins parallel. The total wave function is 
therefore the determinant based on 


oo(rrxs Ty), b(ay; rg) t+. (22.142) 
Since inside a determinant we can take linear combinations of 
the columns without altering the value of the determinant, we can 
equally well use 

Yom 45 Ty )Uas P(a_; a)+o, (22.14b) 


in the notation of (22.12). From the latter form, the total wave 
function Y is seen to be invariant under rotations so that we have a 
& term. Since both orbitals are even under inversion, ¥ is also 
even (g). In (22.14a), %(az) and (zy), respectively, do and do not 
change sign under m,, so that the product changes sign. We there- 
fore have a X,~ term. Since the two electron spins are parallel, 
the term has a threefold spin degeneracy corresponding to S = 1. 
We therefore write it as 52,- with 2S + 1 as a pre-superscript as 
in the case of atomic terms. This resultant spin gives the oxygen 
molecule a magnetic moment and thus accounts for the paramagnet- 
ism of oxygen. 


Separating the atoms 


We consider a molecule in some definite state, and imagine the 
nuclei of the two atoms to be pulled further and further apart while 
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retaining the same symmetry throughout. This process can be 
followed spectroscopically, because as the molecule is excited into 
higher vibrational states, the average interatomic distance increases 
due to the asymmetrical form of the interatomic potential (Fig. 20). 


Ue passers 


Fie. 20. Potential energy U of two atoms as a function of their 

separation a. U, and a, are the equilibrium values. For an energy 

U,, the atoms oscillate between the points A and B on the curve, 
the mean separation a, being larger than ay. 


As the separation tends to infinity, the wave function tends to a 
combination of atomic functions like (22.10) and the energy to a 
sum of the corresponding atomic energy levels. The question is, 
which atomic orbitals and atomic energy levels? This can be 
answered group theoretically. 

Let the molecular wave function transform sceording to the 
representation D‘), and let a be a parameter proportional to the 
interatomic distances. Then the wave function Y% must vary con- 
tinuously with a and hence its irreducible representation cannot 
suddenly change. As we shall prove presently, its energy E(a) 
may cross the energy E®(a) of another level for some value of a 
only if D™ and D‘* are not equivalent. Thus we write down all 
the irreducible representations of the energy levels of the molecule 
in order of increasing energy in one column, and the same for the 
set of separated atoms in a parallel column. The n state belong- 

ing to D@ in one column must then change continuously into the 
n'1 state belonging to D™ in the other column (Fig. 21). 

Before going on to discuss a specific example, it is important to 
prove the theorem used above, namely that two energy levels 
E,(a) and E,(a) do not cross one another if they belong to the same 
irreducible representation D®@. Let the corresponding wave 
functions be Y, and ¥,, and let us suppose that H, and E, are close 
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Molecule Seporated atoms 


Fie. 21. Variation of energy levels FE with a. The labels a, B, y,... 
refer to the irreducible representations D(«), etc., of the symmetry 
group of the molecule. 


together but not equal for some value say a’ of a. Small variations 
of a and a’ can be treated by perturbation theory and we shall 
investigate whether we can expect the energy E,(a) — H,(a) to 
vanish for some value of a near a’. Let the variation of the Hamil- 
tonian be 4#(a). The interaction between the levels Y, and ¥, 
cannot be treated by ordinary non-degenerate perturbation theory, 
and we need to consider the eigenvalues of the 2 x 2 matrix} 


A Cc 
me ei (22.15) 


However, the interaction with all the other levels can be included 
in this matrix in the spirit of the spin-Hamiltonian formalism of 
§ 18 by putting 
= (1|44|n><n|4#|1> 
A = (1497]1> + tg 


C = (1|49#|2> + > Cee 


¢ For an exposition of the matrix formulation of quantum mechanics, see 
Schiff 1955. 
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with an analogous expression for B (Pryce 1950). Since this in 
principle includes up to infinite order of perturbation, we expect 
(22.15) to give a good picture of the energy levels for some finite 
range of a. We only neglect #,— EH, compared with Hy, — E,, 
and this error can be made arbitrarily small by taking a’ arbitrarily 
close to the assumed point of degeneracy. The difference in energy 
between the two levels is given by 


(H, — E,)? = (A — B)* + 4€C*, 
and this can only vanish if 
f(a) = A(a) — Ba) = 0, (22.16a) 
and g(a) = Cia) = 0. (22.16b) 


If Y, and Y, belong to the same representation, then (22.16a) and 
(22.16b) are two independent equations (actually three if we note 
that C is in general complex), and it is vanishingly improbable that 
both should be satisfied for a single value of a. We conclude there- 
fore that the energy levels H,(a) and £,(a) will not cross. If, however, 
YW, and ¥, belong to different irreducible representations, then 
(22.16b) is automatically satisfied by the fundamental theorem of 
§ 13. Thus only the one equation (22.16a) need be considered, and 
it is therefore quite possible that this gives a zero energy separation 
for some values of a. This proves the theorem. Note that the 
theorem may not hold if # varies as a function of two or more 
arbitrary parameters a, b, c, ..., an example of which is given 
in problem 26.8. 

By way of a detailed example, we shall consider the ground 
state of HCl. The first step is to decide what the symmetry of the 
ground state is. The configuration of a neutral chlorine atom is 
(1s)*(2s)?(2p)8(3s)?(3p)5. Of the 3s and 3p atomic orbitals, the 
3pz one or some hybrid of it with 3s will have maximum overlap 
with the hydrogen 1s function. Thus we shall put four 3p electrons 
in the spin-orbitals ¥(3pz)u,, ¥(3py)u,, and the remaining two 
electrons in the covalent bonding spin-orbitals 


[¥(8p2:r — Rei) + P(ls:r — Ry) ]uy. (22.17) 


This gives a total symmetry of 1X+. So far we have assumed that 
HCI is covalently bonded. We might alternatively assume it is 
ionic H+Cl-, analogously to Na*+Cl-. In this case the proton has no 
electron on it, and the chlorine atom has a closed shell (3p). The 
symmetry is therefore again 12+, and the ground state may be 
considered to be a linear combination of the two configurations. 
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Measurements of the dipole moment of the molecule show that it 
can be regarded as about 83% covalent and 17% ionic. This illus- 
trates how one can have intermediate types of bonds by writing the 
wave function of the molecule as a linear combination of several 
wave functions corresponding to several different simple structures. 
This of course can only be done when the separate wave functions 
all have identical symmetry, e.g. 12+ in the present case, since the 
whole wave function must belong to some definite symmetry type. 

We now consider the states of the separated atoms. From Table 
6, the atomic states are hydrogen 28 and chlorine ?P. There are 
therefore 2 x 6 = 12 independent wave functions for the pair. 
We write these as 


> (£1) Poi(M1, Ms = +)$u(M1 = 0, Ms = +), 
abbreviated to (Mz, +)(0, +), 


where 2(+1)P is the antisymmetrizing operator. We can group 
these into terms by following Slater’s procedure (see Table 5), and 
obtain the terms 7/7, 32'+, WI, 12+ as shown in Table 20. These 


TABLE 20 
Lowest Terms for Two Separated Chlorine and Hydrogen 
Atoms 
Atomic states Total Total Termt 
cl H M, M, 
(1 +) (0 +) 1 1 a 
(1 —) (0 +) 1 0 a 
(1 +) (0 —) 1 0 b 
(0 +) (0 +) rt) 1 ° 
(0 +) (0 —) 0 0 c 
(0 —) (0 +) 0 0 d 


t(a) 77; (b) 477; (c) #2; (d) 12+, 
The M, refers to the representation for rotations about the line joining the 
atoms. All the M, = 0 functions are even under the reflection mz. 


all have the same energy in the limit of infinite separation. One of 
these is a 12'+ state, the same as the state of the molecule, and since 
we are dealing with the lowest energy states these join up in the 
sense of Fig. 21. That is, the molecular state goes over into a 
state of the separated atoms which leaves each atom in its lowest 
state. This therefore gives the limiting energy of E(a) as a — oo. 
It should be noted that if 12+ had not occurred among the terms 
found from the ground states of the atoms, then as a —> 00 one or 
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more of the atoms would have been left in an excited state. An 
example of the latter situation is the methane molecule (problem 
22.14). 

Clearly similar arguments can be applied to the way the one- 
electron orbitals change with interatomic distance, and we can 
also investigate what happens in the limit a > 0 as the nuclei of 
the two atoms coalesce into a new single nucleus. 


Spin-orbit coupling 

So far we have based our discussion on the Hamiltonian (22.1) 
without spin-orbit coupling, and all symmetry considerations have 
referred to the orbital part of the wave function only. However, if 
the spin-orbit coupling is important, the Hamiltonian is not invar- 
iant under orbital transformations alone, and we have to apply the 
symmetry transformations to the orbital and spin variables simul- 
taneously. As with the full rotation group, the spin functions 
u,, uw. transform according to double-valued representations, and 

thus so do all spin-orbitals. It has already been shown in § 16 how 
' to work out these double-valued representations of the point- 
groups, and we shall not consider the matter further. As mentioned 
there, they have been tabulated for all crystallographic point- 
groups by Koster (1957). 


Further references 


The whole subject of the quantum mechanical basis of molecular 
structure is covered in rather greater detail than here by Eyring, 
Walter and Kimball (1944). Pauling (1939) and Coulson (1952) 
apply the quantum mechanical ideas to give a description of the 
chemical properties of a wide range of compounds. Van der Waerden 
(1932) discusses the spectra and symmetries of diatomic molecules 
in great detail. Coulson and Longuet-Higgins (1947-48) treat the 
chemical properties of conjugated hydrocarbons in terms of molecular 
orbitals. Roothan (1951) reviews some aspects of molecular orbital 
theory. Boys, Cook, Reeves and Shavitt (1956) show how one can 
calculate the properties of simple molecules quantitatively from 
first principles by systematically approximating to the exact wave 
function by a linear combination of many determinants. Mott and 
Stevens (1957) discuss the use of localized versus extended orbitals 
in solids. 


Summary 


We have discussed the different types of orbitals, atomic, directed 
valence, valence bond, molecular and Bloch orbitals, and in what 
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physical situations they are used to describe the electrons in molecules 
and solids. Their symmetry properties have been used to describe, 
classify, and enumerate them. We have also considered the sym- 
metry of the whole molecular wave function, and how this affects 
the way the energy changes as the molecule is pulled apart into 
single atoms. 


PROBLEMS 


22.1 Calculate the kinetic energies of the orbitals yp and pa 
(22.4), and hence show that %p has the lower kinetic energy. 

22.2 Show that a set of three directed valence orbitals making 
120° angles with one another in the 2, y-plane (Fig. 17b), can be 
constructed out of (2s), %(2pz), 4(2py) atomic orbitals. 

22.3 Show that the following are four directed valence orbitals 
making 90° angles with one another in a plane (Fig. 17a): 


a = (2s) + Y(2pz), ow = (28) — H(2pz), 
Yo = £28) ++ f(2py), op = (2s) — Y(2py). 


Why cannot these be used to describe the four valences of carbon? 

22.4 Show that six octahedrally directed valence orbitals can 
be formed from the set of atomic orbitals (3d)?4s(4p). Hence dis- 
cuss the electronic structure of the octahedral, covalently bonded, 
complex ion [Co(NH3).]*+++, and show that it must be diamagnetic, 
in contrast to the paramagnetic moment that one would expect 
a Cott+ ion to have (Table 15). The nickelous ion Nit+ forms 
complexes with four covalent bonds, sometimes at 90° to one another 
in a plane (Fig. 17a) and sometimes tetrahedrally directed like the 
carbon valences. What would you expect the magnetic moments 
of the two types of complex to be (Pauling 1939, p. 111)? 

22.5 Calculate the kinetic energies of the molecular orbitals 
(22.9) compared with that of a simple two-atom 7 bonding orbital. 

22.6 Referring to Fig. 19 and the discussion below (22.9) in the 
text, suppose that the hydrogen atom on carbon atom number 6 
has been replaced by a radical which attracts the electrons of the 
benzene ring towards this atom. Treat this effect as a perturbation 
and show that the occupied orbitals have the form 


by = Do + other terms, de = Pay as bis 
p, = cos a(,, + #1) + sin «(42 + Y_2) + other terms, 


where for the roughest picture we may neglect the “other terms”. 
Hence calculate the charge density on all the carbon atoms and show 
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that the electrons are removed preferentially from the ortho and 
para positions (carbon atoms 1, 3 and 5). Check this conclusion 
by taking into account all the first order perturbation terms in the 
orbitals. 

22.7 Let the wave function of the hydrogen molecule for all 
values of the interatomic distance RF be written 


] 
Y = cos « Puro + sin « —= (Wa0,1 = Pa0,2)- 


/2 

Here yo is the molecular orbital function (22.2), ¥ao,1 the atomic 
orbital function (22.10), and Yao,2 the same with u, and u_ inter- 
changed, each function being suitably normalized. Choose « to 
minimize the expectation value of the energy (22.1), and hence 
show that «-—>0 for small R and «— 90° for large R. (Coulson 
and Fisher 1949.) Note: the linear combination Wao1— Yao,e 
is required to give an S = 0 function: the corresponding S = 1 
functions refer to excited states of the molecule. 

22.8 The radii of the 4f and 6s shells of the gadolinium atom 
are about 0.3 and 2 Angstrom units. In gadolinium metal, the 
distance between nearest neighbour atoms is 3.6 A.U. What types 
of orbital would you expect to use to describe the magnetic 4f 
electrons and the conduction 6s, 6p electrons in the metal res- 
pectively? 

22.9* What types of wave function are commonly used for 
the electrons in pure solids? Illustrate your answer by particular 
reference to sodium metal, sodium chloride, diamond, solid hydro- 
gen, iron, nickel. References: Seitz 1940, Hall 1952, Mott and 
Stevens 1957, Heine 1958. 

22.10 Prove the following theorem: if a single determinant 
Y = |det (ri, o2)| of orbitals y~; describes a particular non- 
degenerate state of a molecule, then the 4; form a complete basis 
for a representation of the symmetry group © of the molecule. 
Hint: (a) show that TY = cz where T is any element of © and 
cr a constant; (b) suppose that the y¥, have to be supplemented by 
a few more functions ¢, to make up an invariant vector space. 
Show that this would imply that 7’; must include some ¢, for 
some 7' and some y¥;, and hence that it leads to a contradiction 
to (a). 

22.11 Construct a simple function of the variables r, and r, 
that transforms according to the irreducible representation 2- 
of the group oo m. 

22.12 Valence bond orbitals are formed out of the following 
pairs of atomic orbitals on two neighbouring atoms: (a) 2s-2p, 
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(b) 2s-2pz (c) 3d-3d. What are the irreducible representations of 
these valence bond orbitals? Consider both cases of like and unlike 
atoms. 

22.13 Using Slater’s procedure (see Tables 5 and 20), show that 
the lowest configuration (22.13) of the oxygen molecule gives rise 
to the three terms °2-, 12,+, 14,. In what order do you expect 
their energies to lie? 

22.14 Consider the ground state of the methane molecule, and 
suppose that the internuclear distances are being gradually in- 
creased to infinity while retaining the symmetry of the molecule, 
until the wave function describes a set of five separate atoms. What 
atomic states (terms) will the atoms be left in? Are these the ground 
states for the atoms? 

22.15 A bonding and an anti-bonding molecular orbital are 
formed from the 2s functions on two identical nuclei a long distance 
a apart. What functions do the two molecular orbitals change into 
as a@ is decreased continuously to zero (Coulson 1952, p. 93)? 


23. Molecular Vibrations 


In the last section, we used a model in which the nuclei of all 
the atoms in a molecule are fixed at positions R, = (X,, Y,, Z,). 
We showed how one could then discuss the motion of the electrons 
round the nuclei, and in principle one could obtain the electronic 
energy of the molecule to any desired degree of approximation. 
In this section we shall assume that this has been done, so that we 
start with a total energy H(R,) which clearly depends on what 
positions R, we assume for the nuclei. Then 


V(R,) = E(R,) — E(Ray) (23.1) 


is the potential energy of the system when the nuclei are displaced 
from their normal equilibrium positions R,, to an arbitrary position 
R,, though strictly this potential energy function V(R,) applies 
only to static displacements. However, molecular vibrations are 
in general so slow compared with the electronic motion, because 
the mass of the nuclei is so much greater than that of the electrons, 
that we may regard the vibrations as slowly varying semi-static 
displacements. In this section, therefore, we shall base our discussion 
of molecular vibrations on the assumption that the nuclei move 
subject only to the potential (23.1). For a quantum mechanical 
justification of this approach, we refer the reader to Eyring, Walter 
and Kimball (1944, p. 190). 
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Separation of co-ordinates 
The Hamiltonian for our system of vibrating nuclei is 


# =T + V(R,), (23.2) 
where T' is the kinetic energy | 
T = > ymNV2 = > (1/2m,)P.3, (23.3) 


and m,, V,, P, the mass, velocity, and momentum of the «'" nucleus. 
R,, V,, P, all refer to a system of co-ordinates OX, OY, OZ which 
is fixed in the laboratory. However this is rather an inconvenient 
system of co-ordinates to use. If we picture the movement of a 
molecule, we naturally think of this movement as made up out of 
three components; a sideways motion of the whole molecule (trans- 
lation), a rotation of the molecule about itself as a rigid whole, and 
an internal vibration of the nuclei relative to one another. It is 
therefore convenient to define a new system of co-ordinates. We 
use three co-ordinates (Xem, Yem, Zem) for the centre of mass 


Rem = (1/M)2Xm,R., M = LM 


and any variation of Rem means the molecule is moving sideways 
as a whole. We next define the positions of the nuclei by their 
co-ordinates 


Rh Rs (23.4) 


relative to the centre of mass. Then we redescribe the R’, further 
in terms of a frame of reference Ox, Oy, Oz which is rigidly fixed to 
the equilibrium configuration of the molecule. The orientation of 
this frame of reference, and of the molecule as a whole, is specified 
by three angles 6; (i = 1, 2, 3) with respect to the laboratory 
axes OX, OY, OZ. The vibrations of the molecule are then described 
in terms of the displacements 


§,vib = r, — Tap (23.5) 


of the nuclei from their equilibrium positions r,) in the rotating 
Ox, Oy, Oz frame of reference. The 6,”!” are still not our final co- 
ordinates because there are too many of them so that they are not 
linearly independent. To specify the positions of all the N nuclei, 
we need 3N co-ordinates. Six of these are Xem, Yem, Zem, and the 
three angles 6;. We therefore choose any 3N-6 independent linear 
combinations of the displacements §,¥!>, which we shall refer to as 
the vibrational co-ordinates Q, (8 = 1 to 3N-6). . 
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We can also divide up the total Hamiltonian (23.2) corresponding 
to the new set of co-ordinates. We first expand the potential energy 
as 

V(Qz) = dayQeQy + higher terms, (8, y summed) (23.6) 


where we may neglect all higher terms than the second because 
for the present we shall discuss only small, simple harmonic vibra- 
tions.t The vibrational velocities of the nuclei can be specified by 
the co-ordinates Q, = dQ,/dt. We can now write the total Hamil- 
tonian for the nuclei as 


H = HA om 5 H rot + H vin; (23.7) 
KH om = (1/21)Pem?, 


Lz? - Ly? L? 
22, 2Qy, 222’ (23.8) 


H rot = 


H yin = Ca, 00, + Ip QeQ,- (23.9) 


Here #cm describes the sideways motions of the whole molecule. 
H ro, is the kinetic energy of rotation, where Ox, Oy, Oz have been 
chosen along the principal axes of inertia of the molecule. The 
Jy are the corresponding components of the angular momentum, 
and Q,, Qy, Qz the principal moments of inertia. yp is the remain- 
ing vibrational energy, consisting of a kinetic term in the vibrational 
velocities and the potential term (23.6). 

Actually this division (23.7) of the Hamiltonian is not quite 
correct. If we contemplate a gross rearrangement of our molecule 
such that the nuclei are moved far from their original positions 
in arbitrary directions, we come to the conclusion that there is no 
really unique way of dividing this displacement of the nuclei into 
so much pure rotation and so much vibrational displacement. 
The division is only unique in a first order of approximation for 
small distortions. Thus the Hamiltonian (23.7) contains terms that 
depend in fact on both the 6 and the Q,: for example the mean 
moments of inertia Q;, Qy, Q, are not really constants but are 
effected by large vibrations, and the vibrations can contribute to 


{ The linear and constant terms in (23.6) are of course absent because by 
definition the position Qg = 0, for all £, gives a potential energy minimum 
V(0) = 0. Also V is independent of Rom and 6 since physically the potential 
energy of the molecule depends on its internal configuration only and not 
on its position or orientation relative to the laboratory OX, OY, OZ axes. 


16 
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the angular momentum L if we have two degenerate modes as 
in a conical pendulum. References to further discussions of these 
difficulties are given at the end of this section. 


Normal co-ordinates 


The vibrational Hamiltonian (23.9) can be brought into a more 
convenient form by using a new set of co-ordinates 


da=NpQ, 4d =Nap,Q,, (23.10) 


where for the moment N is an unspecified matrix. With these 
co-ordinates the Hamiltonian becomes 


Hoy = (NON) od, + (N--DN-)p,ge, (23.11) 


Now there is an important theorem in matrix algebra (Margenau 
and Murphy 1943, p. 311) which states that a matrix N can be 
found which satisfies the two conditions 


1 


N-10N-1=4}. . 1 sis 


N-DN-2=4] 2. as? 


2 
W3N-6 


Indeed there exist systematic methods for finding N as we shall 
see later. The corresponding co-ordinates g, defined by (23.10) are 
called normal co-ordinates, and we have from (23.11) 


Hvin = 4 2 (de* + ~p"Gp"). 
The conjugate normal momenta are 


ar. 
Pe = 34, = te (23.12) 
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so that we finally obtain # yi» in the form required in quantum 
mechanics, namely in terms of co-ordinates and conjugate momenta 
only 


AH vip = 2 (Pp + «g79g") 
= Tyin + V. 


(23.13) 


Incidentally we notice that the we are all real (wg? positive) since 
the potential energy for any distortion is positive. Each bracket 


Hp? +o%@?) (23.14) 


is the Hamiltonian for a simple harmonic oscillator, and the quantum 
mechanical energy levels aret+ 


Ey = (n+ })hiw. 


Thus the energy levels and eigenstates of #yip can be designated 
by a set of 3N-6 numbers ng (positive integers or zero). . 


BG Re eS S (tp + Ahenp (23.15) 


Symmetry transformations 


We now consider the group 6 of symmetry transformations of 
the molecule when all the nuclei are in their equilibrium positions. 
By this we do not mean the rotation of the molecule in X, Y, Z 
space, which has already been described by the rotational co- 
ordinates 6; We mean the symmetry transformations of the 
molecule (rotations, reflections, inversion) in the x, y, z space, for 
instance the point-group of transformations 6m2 which was dis- 
cussed for the ozone molecule in § 4. Let the equilibrium positions 
of the three oxygen atoms in the ozone molecule shown in Fig. 
22a be given by some fixed co-ordinates in the 2, y, z space. Then 
the symmetry operation consisting of a 180° rotation about the 
y-axis interchanges the nuclei 2 and 3, producing a new position 
of the molecule (Fig. 22b) which is indistinguishable from the 
original position. Consider now a distorted molecule, e.g. that 
shown in Fig. 22c where one atom has been displaced. After rota- 
tion by 180° about Oy, the molecule looks like Fig. 22d, and we 
can consider this as a new distortion of the original molecule. We 
say, therefore, that the distortion of Fig. 22c has been transformed 


{ The energy eigenvalues and eigenfunctions for a simple harmonic oscil- 
lator are derived in appendix G. 
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(a) {b) 


Yh Y, 

(c) (d) 
Fie. 22, Rotation of an ozone molecule by 180° about the y-axis: 
(a) and (b), two equilibrium positions of the molecule, related by the 


rotations and physically identical: (c) and (d), two distortions of 
the molecule related by the rotation. 


AAS 


Fig. 23. Vibrational normal co-ordinates of the ozone molecule, 
(a) transforming according to the irreducible representation A’,, 
and (b) and (c) according to EZ’. 


by the rotation into the distortion of Fig. 22d. In this sense dis- 
tortions can be transformed just like functions f(x, y, z) can, and 
we can have distortions that transform according to definite re- 
presentations. For example Fig. 23c shows a distortion which is 
invariant under a 180° rotation about Oy, whereas the distortion 
of Fig. 23b changes sign. The distortion of Fig. 23a is invariant 
under any transformation of the point-group 6m2. 

From (23.13), the potential energy (23.1) is given in terms of 
normal co-ordinates by 


V =} > wp2gg?. (23.16) 


This must have the same value for two distortions such as Figs. 22c 
and 22d which are related by a symmetry transformation. That 
is, V(qg) is invariant under the group 6. If a particular we is not 
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equal to any other w,, then from (23.16) the invariance of V implies 
the invariance of gg". Thus gg has to transform according to a one- 
dimensional representation in which every element is represented 
by +1 or —1, or what is the same thing, by a real one-dimensional 
representation (problem 23.3). If two or three w,’s are equal, say 
Wg = Wgi1 = Wey, then we require the invariance of 


cog?(Qa" + Yara” + Yei2”)- (23.17) 


It is clearly necessary that ¢g, Ya.1, Ye transform only among 
themselves under ©, i.e. form the basis of a representation. This 
representation must in general be irreducible for the following 
reason. Suppose the representation were reducible, say into a one- 
dimensional representation (base vector 9.) and a two-dimensional 
component (base vectors 911, 42). Then from problem C.4 
(appendix C), we have that 


Agg? + BGos1? + Fa+2”) 

is invariant, where A = &,” and B = dg,,? = Bg,,? are arbitrary 
constants. It is not impossible that A and B are accidentally equal, 
but the probability of this happening exactly is zero. We there- 
fore conclude that 9g, 9e11, 712 in (23.17) transform according to an 
irreducible representation, accidental degeneracies excepted as 
usual. The frequency wg, and the co-ordinates Ye> 18419 Tare are said 
to be three-fold degenerate. More generally, a frequency is said 
to be n-fold degenerate if the corresponding co-ordinates transform 
according to an n-dimensional irreducible representation. 

The case of complex representations is somewhat anomalous. 
Let D and D* be two complex conjugate representations, e.g. the 
two representations of the point-groups 4 which together are labelled 
# in appendix K. Such complex conjugate representations always 
have to occur in pairs with equal values of we. We can see this in 
two ways. If q transforms according to a complex representation 
D under real rotations, ete., then g must itself be complex. To des- 
cribe the molecule by real co-ordinates, we must use 


h=9t, G=q-—4qQ. 
Here g* transforms according to the complex conjugate representa- 
tion D*, so that g, and g, are not invariant but transform into one 
another under G. Thus they must occur together in (23.16) with 
equal values of wg. Another way of looking at this is from the point 
of view of time-reversal. It was shown in § 19 that states transform- 
ing according to D and D* are degenerate. The complex co- 
ordinate q describes a travelling wave, rather than a standing 
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wave oscillation as a real co-ordinate does. For instance g may 
represent a compression travelling round a benzene ring. The 
co-ordinate g* then refers to the same compression travelling in the 
reverse direction around the ring. Clearly if one is a possible motion 
of the molecule, then so is the other, and they have the same 
frequency wg. It is for this reason that a pair of complex conjugate 
representations in appendix K have always been bracketed together 
and regarded as a single two-dimensional representation. 

From (23.12), pg transforms in the same way as gz, so that in 
all of the above cases the kinetic energy T’ = > pg,” remains invari- 
ant. Thus the whole vibrational Hamiltonian is invariant under the 
group © of symmetry transformations. 


Enumeration of normal modes 


We shall now show how the normal co-ordinates of vibration 
can be enumerated, by discussing the specific example of ozone. 
An arbitrary distortion or displacement of the molecule can be 
specified by the nine quantities 5;,, 5y., 5., (« =1, 2, 3) where 
5, is the displacement of nucleus « from its equilibrium position. 
These quantities form the basis of a representation Dio of the 
symmetry group 6m2 of the molecule, whose characters can easily 
be written down. From Fig. 22 a rotation of 180° about Oy turns 
the initial distortion 5,, into a new distortion 6’;, given by 


8’ a1 = — $x, 8 yt = by, on = — 8a, 
8’ x2 — — dz, 8’ y2 = 8y3; 8'29 = — 8:23, 
8’ 23 = — dz2, 8’y3 = bye, 8'23 eae — 822. 


This rotation is represented therefore by the matrix shown in 
Table 21, and the character (sum of diagonal terms) is —1. Actually 


TaBe 21 
The Matrix Representing the Rotation by 180° about the 
y-axis 

bay yn, 8.1 Bz 8y2 3,0 373 3y3 325 
se, | —1 
sn 1 
Ba —1 
5 22 e —1 
80 ' 1 
8 : —1 
Bins aa . : 
3’ys 1 * 
85 . vel . : 
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there is no need to go through all this business of writing down the 
matrices. For any transformations, only those nuclei can contribute 
to the character ‘which are transformed into themselves, i.e. not inter- 
changed with some other nucleus. Thus the 120° rotation about 
Oz has character zero because every nucleus is shifted round to a 
new position in the molecule. Moreover from (2.2), each nucleus 
which 18 transformed into itself contributes +(1 +2 cos 8) to the 
character, where the plus and minus refer to proper and improper 
rotations respectively. The different classes of symmetry elements 
of the group 6m2 are shown in Table 22, and applying the above 
two rules we have 


x(B) = 9, x(mz) — 3, x(32) cad 0, 
x(6z) =0, = x(2y) =—1, —_x(mz) = 1. 


TABLE 22 
Character Table of 6m2 


6m2; £ Mz 32 6. 2y Mz 
A’) 1 1 1 1 1 1 x? + y?; 2% 
A’, I 1 1 1 —1 —1 I; 
AM. At - eel —_—— | . 1 
A’, 1 —1 1 —1 —1 1 z 
E 2 2 —l —l 0 0 xy a? — y?, xy 
EY” 2 —2 —l 1 0 0 Iz, Ty Y2, 20 


Using Table 22 and equation (14.2) or (14.10), we can reduce this 
representation, and find 


Diop = A’, + A’, + A”, + 2E’ + EB" (23.18) 


Since we started by considering an arbitrary displacement of the 
molecule, Dtot includes besides the rotational co-ordinates also the 
representations of the translational and rotational co-ordinates. 
The latter we now want to subtract out. The translational dis- 
placements Xem, Yem, Zem transform into one another like the 
functions x, y, z. This can be seen pictorially from Fig. 24a for 
instance, which shows 8Xem. Thus from Table 22 5Zem belongs 
to the representation A”,, and 5Xem, 5Yem to EL’: ice. 


Derans = A", + E’. 
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Turning to the rotations, we have from (8.2) ‘that a small rotation 
R(@, z) induces the displacements 


§rot? — i6Lr, 


of the nuclei from the original positions. The set of equilibrium 
positions r,, of all the atoms is invariant under 6. Therefore if we 
write R, for the set of displacements 52°, we have that Rz, Ry, 
R;. transform in the same way as Iz, Iy, Iz. The latter are the com- 
ponents of a pseudo-vector, i.e. they transform like x, y, z under 


Yh 


_-—> Xx 


(a) 


Fig. 24. Displacements of the ozone molecule; (a) the translational 
displacement 8Xcm, (b) the rotational displacement Fz. 


proper rotations but remain invariant under inversion. This was 
proved in appendix F and § 11. Thus the characters of the repre- 
sentation formed by Rz, Ry, Rz or Iz, Iy, Iz are given by 


x(of Iz, Iy, Iz) = x(of x, y, z) for proper rotations, 


== —x(of x, y, 2) for improper rotations. 


(23.19) 


This is illustrated for instance by the representations A’, and A”, 
(Table 22) appropriate to IJ, and z. The rotational displacement 
R, is shown in Fig. 24b, and this also makes it clear that R, 
transforms according to A’,. Thus from Table 22 we have for the 
group 6m2 

Drot — A’, + x’. 


Now subtracting Dtrang and Drot from (23.18), we obtain finally 
Dyiyp = A’, + EH’. (23.20) 


The ozone molecule therefore has one non-degenerate vibrational 
frequency (A’,) and one doubly degenerate frequency (Z£’). The 
ymodes of vibration are shown in Fig. 23. 

The normal mode A’, can easily be written down by inspection 
(Fig. 23a), and so can the other two with a little practice. But 
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the projection operators (14.11) enable one to obtain any mode 
systematically. We recall that if 5 represents symbolically an 
arbitrary displacement, then 


q=> (PTS (23.21) 
T 


transforms according to the irreducible representation D”, and 
in particular 


G = > Dy*(T)TS 
T 


according to the i" row of that representation. In order to use 
these formulae, we need to represent a displacement mathematically. 
There are two ways, both of which we have already used to some 
extent. The first method is to represent a displacement by the nine 
co-ordinates (in the case of ozone) 5,,, 5y,, 5,, or by some linear 
combination of them. For example 


8Xem = (S21 + 5x2 + 523) (23.22) 


clearly measures the X displacement of the centre of mass. In 
(23.21) we can put 5 equal to any linear combination of the 5,,, 
78 then being obtained from Table 21 plus the corresponding 
matrices for the whole group. Choosing D” as the representation 
A’,, it can easily be found in this way that the corresponding co- 
ordinate is 


GAD) = By, — $8y2 — $4/38z2 — $8y3 + $4/3523. (23.23) 


What does this mean? It means that if the arbitrary displacement 
described by the 5,, is decomposed into a translation, a rotation, 
and a sum of normal vibrational distortions, then the amplitude of 
the A’, mode (Fig. 23a) is given by (23.23). The second method 
of representing a normal co-ordinate is a more pictorial one, and 
is obtained if we associate with each nucleus a displacement vector 
such as in Figs. 22 to 24. Starting with an arbitrary set of displace, 
ment vectors and applying (23.21), we easily obtain Fig. 23a. The 
relationship between the two ways of representing a normal co- 
ordinate is the following. Fig. 23a represents such a displacement of 
the molecule, that the value of q(A’,) (23.23) is non-zero while the 
values of all the other q’s are zero. Fig. 24a and (23.22) are similarly 
related. A little difficulty arises when a given representation occurs 
more than once in Dio. For instance when we apply the construc- 
tion (23.21) to the representation EH’, we will in general obtain a 
linear combination of the vibrational mode, Fig. 23b or c, and the 
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translation Xem or Yom, Fig. 24a, since each transforms according 
to E’. We then have to subtract out some 5Xem, 5Yem until there 
is no displacement of the centre of mass, in order to obtain a purely 
vibrational mode. 


Infra-red and Raman spectra 


The main discussion of infra-red and Raman spectra will be 
deferred till the next section, but if we neglect all overtones and 
combination frequencies, the situation is briefly as follows. A 
normal vibration gg gives an absorption line in the infra-red spectrum 
if de transforms in the same way as one of the dipole moment com- 
ponents fiz, by, bz. Further, dg gives a line in the Raman spectrum 
if it transforms in the same way as one of the polarizability tensor 
components o4j (or linear combination thereof). The dipole moment 
is an ordinary vector, and the components uz, by, bz transform like 
x, y, z. In the case of ozone this is according to Z’ + A”,. Hence 
the doubly-degenerate vibrational mode £’ of (23.20) is infra-red 
active, whereas the A’, mode is not. The polarizability ag 13 a 
symmetric tensor and hence the components transform in the same 
way as x*, y*, 22, xy, yz, 2x. We have already obtained the trans- 
formation properties of these quantities in (5.20) for the point- 
group 32, and from this they can be assigned by inspection to the 
representations of 6m2 as shown in Table 22. Thus any vibrational 
mode transforming according to A’,, H’ or Z” is Raman active, and 
this includes all of the vibrational modes (23.20) for ozone. 

The transformation properties of these quantities are sum- 
marized in Table 23. From this table together with the group 


TaBLE 23 


Transformation Properties of Some Often Used Quantities 


Transforms like 


translation of centre of x, y, 2 (vector) 
mass, Xeom; Yom; Zom 


Rotational displacements Iz, Iy, I, (pseudo-vector) 
Rz, Ry, R 2 


dipole moment, L,Y, 2 
Has foys fez 


polarizability tensor, aw, y3, 23, wy, yz, 20 
oy 
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character tables (appendix K), the symmetry species of the vibra- 
tional normal co-ordinates of any molecule can easily be derived 
by the method described above, and the infra-red and Raman 
active ones among them written down. 


Linear molecules 


The procedure for obtaining the normal vibrations of a linear 
molecule is the same as that given for ozone above, with one minor 
difference. If we choose the axis of the molecule as the z-axis and 
consider the molecule in its equilibrium state, then a rotation about 
Oz does not change the position of the molecule at all. We have 
therefore only two rotational co-ordinates 6; or Rz, Ry. This has to 
be remembered when subtracting Ryo; out from Dios. 


-——_——-o Ay mode 


<9 —_-—____>-—_—_4-6 A,, mode 


pt. Ey mode 


Fie. 25. Vibrational normal modes of the CO, molecule. The 
second #,,, mode is the same as the one shown above, but in the 
plane perpendicular to the page. 


The symmetry group of a linear molecule is either com or co/mm, 
depending on whether it has a centre of inversion symmetry or not. 
Consider for example carbon dioxide, which has symmetry co/mm. 
Applying our rules for the characters, we obtain for Dtot 


x(Z) = 9, xLR(4, 2)] = 6 cos $ + 3, x(2z) = —I, 
xl) = —3, = x[1R(¢,z)] = —2cosd—1, —_ x(mz) = 3, 


whence from the character table for oo/mm (appendix L) 
Dot = Arg + 2Aon + Fig + 2Ein. 


We subtract out the three translational components in accordance 
with Table 23 and the character table | 


Dtrans > Agu + Ea, 
and the two rotational co-ordinates 


Drot = Eig, 
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and obtain 
Dyin = Aig + Aoy + Fru. (23.24) 


There are therefore two non-degenerate modes (Aig and As,) and 
a pair of degenerate ones (Hi,). Of these Aig is Raman but not 
infra-red active, and the others are infra-red but not Raman active. 
The normal modes are pictured in Fig. 25. 


The ammonia inversion line 


At the beginning of this section we adopted the model that the 
electrons in a molecule move very rapidly around the nuclei, whose 
vibrations in comparison are slow semi-static displacements. 
The wave function corresponding to this model has the form 


F = Pereor(tiei Re) Pauci(Ra); (23.252) 


and it can indeed be shown that this is in general a very good 
approximation to the wave function. Here ¢elect is a wave function 
in the electronic co-ordinates rj, with the nuclear co-ordinates 
R, entering only as fixed parameters. The approximate separation 
(23.7) of the nuclear Hamiltonian further implies that Ynuci can be 
written approximately as 


¥ouci(R.) = Yerana(Rem) Prot(:) Pyin(Qp)- (23.25b) 


However, our argument so far implicitly assumes that the nuclei 
never penetrate far from a definite equilibrium configuration, as 
witnessed for instance by the fact that our discussion has been 
couched in terms of small displacements. If each nucleus departs 
arbitrarily from its position by something of the order of the inter- 
nuclear distance, then it is difficult to regard such a displacement as 
the sum of a rotation of the whole molecule and an internal dis- 
tortion, nor are the approximations (23.25) valid any more. 

For this reason, our model breaks down when a molecule can 
take up more than one distinct equilibrium configuration. An 
example of this is the ammonia molecule (Fig. 26) in which the 
nitrogen nucleus can either be above or below the place of the three 
hydrogen nuclei (position N, or N,). We would expect that the 
nitrogen nucleus has quite a large potential barrier to overcome in 
passing from one position to the other, so that this transition will 
be slow compared with the vibrational frequencies. We can there- 
fore write down two approximate wave function ¢puci,1 and Ynuel,s 
to describe the molecule vibrating about the two equilibrium 
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N 
H 
He = H 
fase / ae 
ao re 
No 


Fie, 26. The two possible positions N, and N, of the nitrogen 
nucleus in an ammonia molecule. 


positions respectively. However, the transition from N, to N, 
is a symmetry transformation for the system of nuclei (though 
it is not a symmetry of each equilibrium configuration individually), 
and the whole molecule wave function must be either the symmetric 
or antisymmetric combination 


Perect(Pnuel,1 + Ynuci,2). (23.26) 


Now the time ¢ for a transition from N, to N, is very long, and the 
energy difference 4H ~ h/t between the two states (23.26) is rather 
small. This has two consequences. Firstly, the transition N, to N, 
gives an extra spectral line, namely the so-called inversion line of 
ammonia which is in the microwave (1.25 cm) region of the spectrum, 
compared with the vibration frequencies which lie in the infra-red. 
This is the line used in the ammonia maser. Secondly, it means 
the rest of the vibrational spectrum is just that of one configura- 
tion alone, with possibly some fine structure of magnitude AE. 
Other examples of molecules with several equilibrium configura- 
tions occur among organic molecules with hindered rotation. For 
instance in CCl, the two C Cl, groups are not free to rotate about 
the central C—C bond because the big chlorine atoms get in one 
another’s way. However, the two groups can take up three different 
relative positions, in which the two sets of chlorine atoms interlock. 


References 


For the separation (23.252) between the electronic and nuclear 
motion, the reader is referred to Eyring, Walter and Kimball 
(1944, p. 190) and to the original paper by Born and Oppenheimer 
1927. The separation of the nuclear Hamiltonian into the centre 
of mass, the rotational and vibrational motions is discussed by 
Wilson, Decius and Cross (1955, Chapter 11) and by Bodi and 
Curtis (1956). The energy levels of the rotational Hamiltonian 
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are conveniently summarized by Wilson ef al. (1955, appendix 16), 
the results for a linear molecule being derived by Eyring et al. 
(1944, p. 72). The whole subject of molecular vibrations is very 
fully and clearly discussed in the book by Wilson, Decius and 
Cross (1955). Two shorter reviews having a scope similar to the 
present section are Rosenthal and Murphy (1936) and Sponer and 
Teller (1941). 


Summary 


The motion of the nuclei in a molecule can be represented approxi- 
mately as the sum of three parts; a sideways translation of the whole 
molecule, a rotation of the molecule as a whole, and internal vibra- 
tions or distortions of the molecule. The latter are expressed in 
terms of the vibrational normal modes which have different sym- 
metries. This symmetry determines whether or not a particular 
vibration gives a line in the infra-red spectrum and the Raman 
spectrum of the substance. 


PROBLEMS 


23.1 What are the symmetry groups for the following molecules: 
C,H,, H,, H,O, DHO (heavy water), NH,, CH,, benzene, cyclo- 
hexane? Note: H,O is not a linear molecule because of the lone 
pair electrons on the oxygen; benzene is planar. 

23.2 Enumerate the symmetries of the normal vibrations of the 
above molecules and illustrate some of them by figures. Determine 
which ones contribute as fundamentals to the infra-red and Raman 
spectra. 

23.3 Prove that every element of a real one-dimensional represen- 
tation is +1. Hint: use the unitary property of problem C.2. 

23.4 By looking at the character of m,, sort out the normal 
modes of benzene into those that are vibrations in the plane of the 
molecule and those perpendicular to it. 

23.5* In order to interpret the vibration spectra of a com- 
plicated molecule, one usually makes a theoretical calculation of 
the frequencies from a theoretical model, in which some force 
constants are assumed for the various inter-atomic bonds to repre- 
sent their resistance to stretching or bending. Give an account 
of how the normal modes and frequencies are calculated from the 
model, and in particular of how group theory can help in the cal- 
culation. Reference: Wilson, Decius and Cross (1955). 

23.6* Outline how the energy levels of a rigidly rotating molecule 
are derived (rigid rotor), distinguishing between the linear, spherical, 
symmetric and asymmetric cases. Obtain the degeneracies of the 
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levels, and selection rules for transitions between them. References: 
Casimir 1931; Wilson et al. 1955, appendix 16; Herzberg 1945. 

23.7* Show how to incorporate the nuclear spins (§ 21) into the 
wave function (23.25) of a whole molecule, and discuss their effect 
on the spectra of diatomic molecules with particular reference to 
H,, HD, D,, and O,!*. The nuclei of H, D, O1* have spins of 4, 
1, 0 respectively, and D stands for a deuterium atom (Eyring et 
al. 1944, p. 265). 


24. Infra-red and Raman Spectra 


In the last section the wave function for a whole molecule was 
written in the approximate form 


| P= Perect(Te3 R,) Pirans(Rem) Prot(94) tyip(Ya)- | (24.1) 


Here the vibrational wave function 


Pvin = Yyin(™1, Ne, - . - Msn_6e) (24.2a) 


corresponds to an excited state of the molecule with ng quanta 
of energy in the f" normal mode of vibration, the energy of the 
state being 

E(ng) = 2 (Mp + $)hwg. (24.2b) 
The existence of these vibrational excited states and the transitions 
between them give rise to various types of line spectra, of which 
we shall discuss the infra-red absorption spectrum and the Raman 
spectrum. 


Infra-red absorption spectrum _ 
If two energy levels H, and HF, of the molecule are separated by 


an amount 


then the molecule can absorb light of frequency w and make a 
transition from the state H, to #,. For light polarized in the X- 
direction, the transition probability, and hence the absorption 
coefficient of the bulk liquid or gas, is proportional to the modulus 
squared of the matrix element 


(2|Dx|1> = J W+Dx¥, dv. (24.4) 
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Analogously to the atomic case (13.7), Dx is the X component of 
the dipole moment operator 


D=—edr,+e>Z,R,, (24.5a) 


where the summation is over all the electrons (+) and nuclei («) 
Z,, being the atomic number. 

The formula (24.4) can be reduced using the wave functions 
(24.1). In almost all cases, the electronic energy levels of a molecule 
are separated by about 1 electron volt (10‘ cm—) or more, corre- 
sponding to visible and. ultra-violet frequencies. Infra-red light has 
insufficient energy to excite the electrons to above the ground state 
Yeo, 80 that in an infra-red transition we must have 


Yelects 1 = Pelects 2 = Peo- 
Turning now to the translational wave function, we have 
trans = exp (tk: Rom) 
for a freely moving molecule, so that 
trans(R, + A) = exp(tk: A)ytrans(R,). 


Since D is invariant under a translation A of the molecule, we have 
from the fundamental theorem (13.8b) that trans, and ¢transse 
must belong to the same translational representations for (24.4) 
to be non-zero, i.e. we must have k, = k,. We consider finally the 
rotational and vibrational co-ordinates, and refer D to the 2, y, 2 
co-ordinate axes fixed to the molecule as defined in the last section ; 


Dy = lx2zDz + lxyDy ae lxzDz, (24.5b) 


Here the lz; are the direction cosines of the x, y, 2 set of axes relative 
to the laboratory set of X, Y, Z, and are functions of the rotational 
co-ordinates 61 only. With these substitutions (24.4) becomes 


<2|Dx|1> = 2 | Prot, xsProt,1 dv | Hin,e 14 Prin, dY, 
, , (24.6) 


4(de) = [Poot (eesRy) Dj(roR,) poo(tsR,) dve- | (24.7) 


In (24.7) only the electron co-ordinates are integrated over. Physi- 
cally (7 = 2, y, z) is the value of the dipole moment of the molecule 
when the nuclei are in the positions specified by the gg or R,. 


where 
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The form (24.6) for the transition matrix element is particularly 
convenient for discussing selection rules. The first integral depends 
on the 6; only, and because of the factor lx,;(4) the transition ¥, 
to Y, can involve a change in the rotational as well as the vibrational 
state of the molecule simultaneously. However, the rotational 
quanta of energy are of the order of only 1 cm™~, whereas the 
vibrational frequencies are in the infra-red with iw ~ 10? to 104 
em-1!, Consequently the purely rotational transitions are seen 
more in the microwave region of the spectrum. In the infra-red, a 
transition involving a vibrational quantum can be accompanied by 
a& simultaneous rotational transition, so that the vibrational lines 
have a fine structure. This structure is determined purely by the 
first integral in (24.6) and we shall not consider it further. How- 
ever, we obtain a selection rule for whether the whole vibrational 
line is absent or present from whether the second integral of (24.6), 
namely the matrix element 


2|uj|1> = | type HS Yyin. In 
= | (hyine Pyipa)* Hy dp, (24.8) 


is zero or non-zero. From the matrix element theorem (13.8b), 
(24.8) can be non-zero only if the representation of p; is contained 
in the reduction of the representation D(Vi?.) x D(vib.*, More- 
over, as usual in such symmetry arguments, we can assume that the 
converse is also true; i.e. if the representation of py; is contained in 
Divib.2) x Divib.*, then we expect (24.8) to be non-zero (apart 
from accidental cases) as long as we have included all the symmetry 
properties of the system in the group & to which the representations 
refer. From (24.5a), the py transform like a vector in the same way 
as 2, y, z (Table 23), according to the representation D‘vect) gay. 
Also in (24.6) there is a summation over j, so that the transition 
probability is finite if at least one of the matrix elements <2|u;|1) 
is non-zero. We have therefore derived the general rule: 


there is an infra-red absorption line corresponding to 
the transition yin,1 to dyin, if and only wf the representa- 


tion D'vect) of one or more of the dipole moment components 


Hay bys ez 18 contained in the reduction of the representation 
D(vib.2d ye Dlvib.ty*, 


(24.9) 


We shall not discuss till later how to derive the transformation 
properties of the vibrational wave functions, but suppose for the 


17 
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sake of example that we are interested in the absorption spectrum 
of ozone, and that ¢vin,, and vin, transform according to the 
representations A’, and H’ of the symmetry group 6m2 (Table 22). 
The p; transform as a vector like x, y, z, ie. in the present case 
Ux fy according to the representation HZ’ and yz according to A”». 
Also the product yin,o¥vin,, transforms according to E’ x A’., 
which from (14.3) and the character Table 22 turns out to be equal 
to BE’. This contains the representation of uz, wy but not that of 
tz. Hence the matrix element <2|yz|1> is zero whereas <2|y2|1> 
and <2|uy{1> are non-zero, and from (24.6) the latter two are 
sufficient to guarantee an absorption line in the spectrum corre- 
sponding to the energy EH, — Hy. 


Raman spectrum 


In the Raman effect, one shines light of a fixed frequency a, 
usually from the mercury spectrum on to a substance, and one 
observes light of a different frequency w, emerging scattered by the 
substance. The molecules of the substance meanwhile make a 
vibrational and/or rotational transition Y, to Y,, the total energy 
of course being conserved 


hw, +- Ex, = lw, + E,. 


Thus it is the frequency shift (w, — w,) that corresponds to the 
energy difference ZH, — H,. The intensity I, of the scattered light 


is given by 
I, = (weS/e*)| <aay> [2h (24.10) 


Here a», is a component of the polarizability tensor of the molecule 
when the nuclei are in fixed positions R,. The suffices 1 and 2 refer 
to the directions X, Y or Z of polarization of the incident and 
emergent beams. As well as being derivable quantum mechanically 
as in problem 24.11, (24.10) is suggested by a semi-classical argument. 
The electric field &, of the incident beam induces a dipole moment 
in the molecule of strength 


B = a8. 


This oscillating dipole radiates energy at a rate proportional to 
p2, and |&,|? is proportional to the indicent intensity. Thus 


I, p® = ah? oc aT). 
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Since the molecule is itself vibrating with frequencies wg, the various 
combination frequencies 


We = W, + wa, etc. (24.11) 


occur among the radiated light. Just as in the direct optical absorp- 
tion theory the classical px gets replaced the quantum mechanical 
matrix element (24.6), so analogously a, gets replaced by 


<2lerz|1> = i} Prot,elrilsiProt,1 dv | Pride 4 Yvin,1 dv. 
(i, j summed) 
(24.12) 


As in the case of infra-red absorption, the first integral determines 
the purely rotational Raman effect and the rotational fine struc- 
ture of each vibrational Raman line, though these effects are only 
rarely observable in practice. The last integral in (24.12) determines 
the selection rules for the possible vibrational transitions. The 
oy are the components of a tensor and transform respectively like 
x*, y*, 2", ay, yz, ex. Analogously to (24.9), we obtain the rule: 


a possible vibrational transition vip, to Yvin,2 can give 
rise to a line in the Raman spectrum if and only if the 


representation D'tensor) of one or more of the polariza- 


bility component oj occurs in the reduction of 
Dvib.2) x Pivib.1)* 


(24.13) 


For example consider again the transition yip,1(A’,) to pyin,2(Z’) 
for ozone already discussed above. The representation D‘vib.®) 
x Divib.D* ig ; 

E’ x A’, a E' 


as before, and from Table 22 this is seen to be the same as the repre- 
sentation formed by the components azz — ayy, «zy. The Raman 
spectrum therefore contains a line due to this transition. 


Fundamental frequencies 


The selection rules (24.9) and (24.13) are completely general. 
To apply them, we would first have to work out the irreducible 
representations and energies of all the vibrational states of the 
molecule, and then start testing all possible pairs of states to see 
whether the transition between them is an allowed one. While 
this may be a useful thing to do in the later stages of analysing a 
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spectrum to understand all its detail, it is clearly unsuited to giving 
a quick overall idea of what kind of spectrum to expect from any 
given molecule. It is too tedious, and does not distinguish between 
strong lines and ones too weak to observe. Moreover the selection 
rules (24.9) and (24.13) are concerned with the initial and final 
states and not at all with the energy difference between them. For 
instance a transition from a state with ng = 6 to one with ng = 7 
gives the same spectral line as a transition from ng = 2 to ng = 3, 
or indeed as any transition involving an energy difference of one 
quantum hw,g. The above selection rules can tell us individually 
which of these transitions are allowed, but the argument we shall 
now develop will tell us whether the line as such is allowed, i.e. 
whether or not there are any allowed transitions with a given energy 
difference. Furthermore the allowed lines will be grouped into sets 
in approximately decreasing order of intensity. 

The dipole moment pz, py, pz (24.7) is a function of the internal 
co-ordinates g, only, and we can expand it (using the summation 
convention) as 


Op4 re] 
My = bye + Oe (#),+ oe (nae 0 cis 


+ higher terms. 


Here the suffix zero refers to the equilibrium position of the nuclei. 
The first term pj, is the permanent dipole moment of the molecule, 
and from (24.6) we see that it determines the intensity of the purely 
rotational transitions. We can tell whether pj, is zero or not as 
follows. On the one hand it has already been noted from (24.5a) 
and (24.7) that yz and hence pz transforms like the function z, 
which in the case of the ozone molecule (point-group 6m2) means 
the representation A”,. On the other hand pz, is a property of the 
equilibrium configuration of the molecule, and must therefore be 
left invariant under the group 6m2 (since each operation of the 
group simply moves the nuclei around into a new set of positions 
which is physically equivalent to the original set: this is what is 
meant by the symmetry group of the molecule!) That is, uz9 trans- 
forms according to the identity representation A’,. Now pz cannot 
at the same time transform according to both A”, and A’,, and the 
only way out of this self-contradiction is to have zo zero. Similarly 
zo, Hyo are zero and we conclude that the ozone molecule has no 
permanent dipole moment (as is also intuitively obvious). 

A similar argument can be applied to the second terms of (24.14). 
The suffix zero on the quantity (G,;/2qg)) denotes that it is evaluated 
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at the equilibrium position. It is a numerical quantity which is a 
property of the molecule in its equilibrium configuration. It is 
therefore invariant under the symmetry group, i.e. transforms 
according to the identity representation which in the present case 


is A’,. Thus the term 
7) 
qe (#4) (24.15) 


transforms in the same way as gg. But it also has to transform in 
the same way as yj, and hence (24.15) can be non-zero only af Ys 
transforms tn the same way as 4. In the case of ozone the vibrational 
co-ordinates are 
gq, transforming according to A’,, 
Jz, 73 transforming according to E’, (24.16) 


and pz, Hy, #2 transform according to HE’ and A”, as already noted. 
Hence the q, term is completely absent, but the q,, ¢3 terms occur 
IN fly, fz. _ 

The question now is, what kinds of spectral line does the term 
(24.15) give rise to? It is shown in most elementary books on 
quantum mechanics and also later in this section (see equation 
(24.31)), that for simple harmonic oscillator wave functions the 
matrix elements of gg are zero, except between pairs of states differing 
in energy by one quantum fiw,s. In symbols 


| b* (Mp0) Ie ¥(Ng,)dv = 0 unless 2g. — %gy = +1 (24.17) 
and ny». = 1, for » # B. 
The absorption lines are thus at the fundamental frequencies Wz. 


Combining this with the italicized result above (24.16), we have 
the rule: 


there 1s an infra-red absorption line at the fundamental 
frequency we if the corresponding normal co-ordinate qg 
transforms in the same way as one of the dipole moment 
COMPONENTS fea, by, H2- 


| (24.18) 
This is just the rule stated without proof and illustrated in § 23. 


The theory for Raman lines is completely analogous. We expand 
the polarizability components; 


ote) OPagy 
Oy, = Mg + Oe | Doxj ni Tey 29804,) o (24.19) 
+ higher terms. 
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The first term on the right side determines the intensity of the purely 
rotational Raman spectrum, which only exists if at least one of 
the components oj (or a linear combination thereof) transforms 
according to the identity representation. The next term gives rise 
to the fundamental Raman lines and we have the rule: 


the Raman spectrum contains the fundamental line of 
frequency shift we tf dg transforms in the same way as 


one of the polarizability components o4; (or a linear com- 
bination of them). 


(24.20) 


Overtones and combination tones 


The same type of argument can be applied to determine which 
quadratic and higher terms are present in the expansions (24.14) 
and (24.19). All the derivatives such as (@%u,/0q,0g,) in (24.14) 
remain invariant under symmetry transformations of the molecule, 
and they are non-zero only if the appropriate product like q,q, 
transforms according to the same irreducible representations as 
py does. For instance from (24.16), we have for ozone 


q,2 transforms according to A’, x A’; = A’, 
919» 9193 transforms according to A’; x H’ =H’, = (24.21a) 
qo? + 937 transforms according to A’, 
Ge” — 93, Y29g transforms according to EH’. (24.21b) 


Thus all of the above terms can appear in one or another of the 
oj, but only the terms (24.21a) and (24.21b) can occur in the py; 
expansions. 

Before going on to derive what lines these quadratic and higher 
terms give rise to, it is necessary to recall some results about simple 
harmonic oscillator wave functions which are derived fully in 
appendix G. If we put 


@ = (2hw)-V2(p + twq), a* = (2hw)-'3(p — twg), (24.22) 
then the Hamiltonian for each mode of vibration becomes 
H# = ¥(p? + wg?) = (aa* + Pho. (24.23) 


The eigenvalues of the operator aa* are the positive integers n 
(including zero). The energy levels are therefore 


En = (n + Phew 
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and 7 is the number of energy quanta in the oscillation above the 
“zero-point energy” $iw. The corresponding wave functions can 
be written 


in = apy, (24.24) 


The wave functions in this form are not normalized, but this is 
irrelevant in the present context since we are not interested here 
in calculating absolute intensities of lines. Similarly the vibrational 
Hamiltonian (23.14) 


H vin = 2 $(pg" + we?gp*) (24.25) 
has eigenvalues 
Eng) = 2 (vg + $)hiwg (24.262) 
and eigenfunctions 
tyin(%_) = i (ag)"8 po, (24.26b) 
1 
where fy = exp| — 7 > ots ; (24.27) 
7 


Let us first consider the case of only one fundamental frequency 
w. Suppose symmetry allows a term Ag to be present in the 
expansion of jy or og. From (24.12) we have 


qN = Bia — a*)N = Bia + (a*)N + other terms], 
where B = (—1)*(h/2w)t%, 
so that the transition matrix element (24.8) is 
<2|yj|1> = AB i tn,*[a% + (—a*)% + other terms}yn, dv, 
(24.28) 


where A is some constant. This expression can be evaluated with the 
help of the symmetry properties of the Hamiltonian. (24.23) is 
invariant under the group of transformations T’,, 


(24.29) 


where ¢ is any angle, so that the wave functions %, belong to 
definite representations of this group. Actually (24.29) is a rather 
unphysical transformation which cannot be applied easily to yp. 
However, when we use the wave functions (24.24), %) always turns 
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up in matrix elements in the combination ¥*, which is invariant. 
It is therefore irrelevant how 1, itself transforms, and we arbitrarily 
suppose that y, is invariant under 7. Hence from (24.24), yn 
transforms according to 


xT) = exp(ind). (24.30) 


If we take only the first term in the square bracket of (24.28), we 
have that the matrix element transforms according to 


x(Tg) = exp i(—m, + N + m)¢. 


From the result (13.8c) of the matrix element theorem of § 13, this 
part of the matrix element is non-zero only if x(7',) is the identity 


representation, i.e. if 
Ny — nm, = N. (24.31a) 


Similarly the second term in (24.28) gives a contribution to the 


matrix element if 
Ne — tm == —N (24.31b) 


Physically therefore, (24.28) always allows transitions in which NV 
quanta are absorbed or emitted. These N-quanta transitions are 
referred to as overtones of the fundamental line of frequency we. 
If N > 1, (24.28) contains other terms leading to further lines 
which we shall mention again below. For VN = 1 we just obtain the 
selection rule (24.17). More generally, «of 


qa°d,°Qet ... with N=b+c+d+... (24.32) 


transforms in the same way as one of the ps or a4, then this term occurs 
with non-zero coefficient in the expansion of ps or ay, and all the 
N-quantum transitions 


E, — E, 


a = stbwg + cy + dog +... (24.33) 


have a non-zero intensity. Such transitions are called combination 
tones. The choice of + signs throughout (24.33) derives from the 
fact that (24.32) always contains equal powers of a, and ag*, etc., 
like in (24.28). This has the consequence that a summation tone 
wg + w, and the difference tone wg — w, are either both present 
or both absent from a spectrum. It also means that in a Raman 
spectrum the lines are symmetrically placed about the exciting 


t This follows from the fact that 4, is non-degenerate and therefore trans- 
forms according to a one-dimensional representation, y(7') = exp (ix) say. For a 
one-dimensional representation we have x*(7) x(7) = x(T-") x(T) = x(£) = 1. 
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frequency w, the absorption and emission of a given energy differ- 
ence |#, — E,| being either both possible or both forbidden processes. 

We now mention two complications. Firstly the expansion (24.28) 
contains other terms besides a and (a*)%, for instance the term 
aN-1q* which by the same argument is seen to give an (N — 2)- 
quantum transition. If the corresponding frequency wg, is non- 
degenerate, it can easily be shownt that this (NV — 2)-quantum 
transition would already have been found in analysing the term 
qi’-*. In the case of degenerate frequencies however, it is possible 
to obtain new lines this way. The second complication arises only 
with doubly or triply degenerate frequencies. Consider the term 
9a°d,- From (28.33) this gives the transitions 


E, — E, = (2wg + w,)h, (2m, — w,)h, 
(—2w, + w,)h, (—2wg — w,)h, 


in general all obviously 3-quantum transitions. However, if ws = w,, 
these transitions become 


E, =. E, = Shwe, hwa, —hw,, —3hwe, 


and the middle two appear to be one-quantum transition. How- 
ever, they have only the intensity of a 3-quantum process (see 
below), if the genuine fundamental line is not allowed. 


Relative intensities 


We shall now calculate the order of magnitude of the relative 
intensity of an N-quantum transition (28.33). The usual Hamil- 
tonian for a simple harmonic oscillator is 


1 

— 72 a 272 

an? tagmed> 

and comparison with (23.14) shows that our gg are more or less 
Ig = mi/2 x (displacement of atoms) 


where m is the mass of the vibrating atoms. Thus the matrix 
elements of the term (28.32) are of order 


(m28)N 


t If gg transforms according to a real one-dimensional representation, then 
ge? transforms according to the identity representation. Thus gg and a — 
transform in the same way, and are either both present or both absent from 
the expansions (24.14) and (24.19). 
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where 6 is the distance through which the atoms vibrate. Also 


ON; - lp 
Ogg? 09,° Oa...) (m¥2R)N 


where £ is the internuclear distance. Thus the intensity of the lines 
is proportional to 


2 §\2N 
|etut>| re wl*(3) 


We can estimate 6 as follows. Classically, when the atoms are at the 
end of their swing, all the energy is in the form of potential energy 


and 
E = (n+ $)hw = $mw?d?. 


Taking typical values n= 5, hw = 200cm—!, m=165 proton 
masses, R = 10-8cm, we find 6/R ~ 0-1, so that successive sets of 
N-quantum transitions differ in intensity by a factor of about 
(5/2)? = 1/100. We shall see below that the presence of anhar- 
monicity increases the intensities of some lines rather, but it never- 
theless remains true that transitions with high values of NV become 
increasingly weaker. For example, in the observed spectrum of 
CO, as listed by Herzberg (1945, p. 274), the line with highest NV 
is a 7-quantum transition. 

All these intensities have to be multiplied, of course, by the fraction 
f, of molecules actually in the state %yip,1 initially, and hence able 
to make the particular transition. If the sample is in equilibrium 
at an absolute temperature 7’, then 


f, = (const) exp(—#,/&7’). (24.34) 


Molecular vibrations have an energy of about 2000 cm-1, whereas 
at room temperature k7’ =200cm-!. Thus transitions from 
excited states will tend to be very weak. This is seen very clearly 
in the Raman effect, where the lines on the high frequency side of 
the incident radiation (anti-Stokes lines) are much weaker than 
those on the low frequency side (Stokes lines), the intensity ratio 
of corresponding lines being 
£,— E, 
exp — “a 


Anharmonicity 


We shali now examine one of the simplifying assumptions that 
has been made throughout § 23 and the present section so far. In 
§ 23, we started with an arbitrary potential V(R,) governing the 
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nuclear motions, and expanded this about the equilibrium con- 
figuration as a power series (23.6) in the Q,. Assuming that the 
amplitude 6 of a nuclear vibration is very small compared with the 
internuclear distance R, we retain in H yin (23.9) only the quadratic 
terms, and this then led to-the simple harmonic oscillators Hamil- 
tonian (23.13) or (24.25). However, the assumption is not strictly 
valid because we showed above that 5/R ~ 0-1. The true situation 
is that the vibrations are sufficiently small for (24.25) to be a very 
good first approximation to the Hamiltonian, but that the real 
Hamiltonian contains anharmonic perturbing terms such as 


9eI8° + Jey9e"y + fetes +... (24.35) 


which have small but observable effects. These effects are of two 
kinds, (i) an increase in the intensity of some lines, and (ii) a splitting 
of some degenerate levels and lines. 

As regards intensities, the presence of perturbing anharmonic 
terms in the Hamiltonian means that the eigenstates will not have 
the simple harmonic oscillator form (24.26), and those arguments 
which depend on this specific form have to be re-examined. For 
instance the selection rule (24.17) no longer holds, and the term 
Qe in the expansions of py; (24.14) and o4 (24.19) can give rise to 
various lines in addition to the fundamental frequency wg. These 
other lines will have an intensity proportional to the amount of 
anharmonicity in the potential. This can be seen explicitly by 
considering the ggq,* term in (24.35). We restrict ourselves to the 
case of a single frequency w and drop the suffix 8. From ordinary 
perturbation theory and the selection rules (24.31) for the matrix 
elements ¢2|gq?{1> of the perturbation, we have that the perturbation 
mixes wave functions %, with 4, = +3 (Schiff 1949, p. 151). 
The perturbed wave functions are 


Ppert() = Cniln + Cn+s¥ni3 + other terms 
Ppert(n + 4) = dasrapnra + dnyi¥ni1 + other terms, 


where Cy dais 1, Cni3 ~dniy ~g, and where the wave 
functions on the right-hand side are the unperturbed ones (24.24). 
If the expansion of y; contains the term Ag, then we have 


i] Ppert(” + 4)ujppert(n) dv 
~ Aenisdn is | Whiatnss de + Aendn ys | Phin do oc g, 


so that the linear term Aq in the expansion of 4; leads to a 4-quantum 
transition with intensity proportional to g?(6/R)?. In this kind of 
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way the allowed N-quantum transitions with high N usually have 
their intensities increased considerably above that expected from the 
unperturbed simple harmonic oscillator calculation. 

Some of the energy levels (24.26a) of the simple harmonic oscillator 
Hamiltonian are highly degenerate, and the anharmonic perturba- 
tion (24.35) produces a splitting in it. Consider a pair of degenerate 
frequencies wg = w, = w say. The energy 


E(n) — (ng - g)hwe a (n, ot thw, = (2 a ny, aa 1 hw 
(24.36a) 


depends only on the total number of quanta n = ng + 2n,. The 
level is therefore (n + 1)-fold degenerate,} corresponding to the pairs 
of values of ng and n,, 


Ng=n,n—I1,n—2,...,1,0; 
n, =0,1,2,...,n—1,n. (24.36b) 


For the point-groups, the irreducible representations are at most 
3-dimensional, so that for n greater than 2 or 3, the wave functions 
of the level must form the basis of a reducible representation. This 
will be split by the perturbation (24.35) into irreducible components 
in accordance with the general theory of § 6. The splittings are 
usually up to about 50cm~1!. For example in ozone the n = 2 
level of the degenerate frequencies we = w, (24.16) is three-fold 
degenerate in the simple harmonic oscillator approximation, the 
wave functions being 


Ay7 bo, Agh go, Ag *hq. (24.37) 


These transform according to the representation A’, + EH’ of the 
symmetry group 6m2 (see below), and the anharmonic perturbation 
therefore splits the level into a non-degenerate one (A’,) and a 
doubly degenerate one (H’). The spectral lines are split corre- 
spondingly. 


Symmetry of the wave functions 


In order to study the splittings of the energy levels by anhar- 
monicity in detail, it is necessary to know the transformation 
properties of the wave functions (24.26b). From (24.27), %, remains 
invariant under the symmetry group of the molecule just like the 


ft Although this degeneracy is obvious by inspection, it is an interesting 
exercise to derive it group theoretically. (See problems 24.9 and 24.10.) 
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potential energy 4 > w,7q,7 does. We also havet pg = Gg (23.12), 
so that pg, transforms in the same way as gg, and hence so do ag 
and a,* (24.22). If we write the representation of gg and ag as D‘), 
we have that the state (ng) (24.26b) with energy > (ng + 4)hiwg 
transforms according to the product representation 


D = D(ng) x D(n,) x ...; (24.38) 


where D(ng) = D® x D® x Di) x ... to ng factors. 

For a real one-dimensional representation, we have that D‘®) 
x D) is the identity representation,t so that if wg is a non- 
degenerate frequency 


D(ng) = D” if ng = odd, 
= identity representation if ng = even. 


The reduction of D(ng) is rather more complicated for degenerate 
frequencies, wz = w, = w say, D'\*) — D™, The degenerate states 
(24.36) now transform according to 


D(n) = D® x D® x D® x... ton = ng + n, factors. 
(symmetrical product) (24.39) 


Here the words “symmetrical product’’ denote that this representa- 
tion is not quite the same as what is usually meant by a product 
representation. We first illustrate this by taking the case n = 2 
as an example. Suppose that we have four quantities a, a, and 
bs, 6, each pair transforming according to the two-dimensional 
representation D‘®), There are in general 2 x 2 = 4 second degree 
products between them, 


App, Agb,, a,Dz, a,b, 


the important point being that these reduce to only three independent 
ones when we put ad, = bz, a, =b, because then a,b, and a,b, 
become identical. Specializing to the case of the degenerate fre- 
quency (24.16) of ozone, we have the four linear combinations 


(@gb2 + ab); (Gab, — 3b, debs ++ @gbe); (d_b3 — @gbq); 
(24.40a) 


+ This result is directly connected with the form (23.14) of the Hamiltonian, 
and thus assumes implicitly that the gg are all real. It is therefore essential 
in this connection to regard a pair of one-dimensional complex conjugate 
representations as a single real two-dimensional one, 


t There are three simple ways of seeing this: from the footnote below 
equation’ 24.29), from problem 23.3, and from (14.13) or problem 13.13. 
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transforming according to 
EH xXxH=A',+E£E'+A’,. (24.40b) 


This is the reduction of the ordinary product representation E' x E’ 
However, to obtain our wave functions (24.37) we want to put 
a, = bz, G3 = b,, in which case the last term in (24.40a) becomes 
identically zero and the representation A”, drops out of (24.40b). 
We have what is called the symmetrical product 


E' x E’ (sym) = A’, +E’. (24.40c) 


More generally if a set of base vectors ¢; transform according to 
D, the products ¢;; are said to transform according to the sym- 
metrical product D x D (sym). This is equal to the ordinary 
product D x D with some of the irreducible components dropped 
out, because ¢i¢; and ¢;¢; are not independent quantities (see 
problem 9.4). The same applies to repeated products D x D x D 
x ... (sym). Clearly in the case of degenerate frequencies it is the 
symmetrical product (24.39) that we have to use in (24.38) to obtain 
the total representation. 

From (14.3) the characters of the ordinary n" product are 
{x(Z")]", where the (7) are the characters of D‘®, but from what 
has been said above this cannot be applied to the symmetric product 
D(n) and we have to develop a new formula for the characters of 
D(n) which we denote by x'")(7). For a doubly-degenerate frequency, 
the matrix D,;‘°(7) can always be reduced by an equivalence 
transformation (5.15) to the diagonal form 


b ‘ 

: ce} 
Then x(7') = b +c. Let a, and a, be the base vectors corresponding 
to this form of Dy‘*)(T). With these base vectors the other matrices 


of the representation will not be diagonal, but it is sufficient that a 
pair of such base vectors exists for each transformation 7’. We have 


Tag => bag, Ta, = a, 
Tag = bag. Ta, = cha, 
T(ag"*a,") — bYTc?(ag"—"a,"). 


The base vectors of the representation D(n) are the eigenfunctions 


Aghby, ag" ta, rho, Ag” "a,b, ove a,™o 
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of the degenerate energy level (24.36). The characters of D(n) 
are therefore 


xT) = be + b*-Ie + bM-%c2 +... om 
= $[(b + c)(6*-? + b"-%e +... + 0%-7) + (6" + €%)] 
= Hx(T)x* (TL) + x(T)). (24.42) 
This recurrence relation serves to determine the characters y'”)(7') 
of the symmetrical product representation D(n). The total repre- 
sentation D (24.38) can then be built up. For the sake of com- 
pleteness we quote an alternative formula for x'"(7') (Wilson et 
al. 1955, p. 355): 
if x(7) = 0 and x(Z?) = 2, then 
x'™(T) = 0 for n odd, 
= 1 for n even; 


i 1) 
otherwise OUT) = anes (24.43) 
T 
where cos 67 = 4y(T); 


in the limiting cases of 67 = 0 or 7, 


x'2(T) = n + 1 for Or = 0, 
= (—1)"*1(n + 1) for Op =z. 


Analogous formulae exist for three-fold degenerate frequencies 
(problems 24.5 and 24.6). 


Summary 


The intensities of the lines in the infra-red absorption spectrum 
and the Raman spectrum are largely determined by the matrix 
elements 


| Pyib,2 M4 Pyin dv and Pyin,2 oy Pyini dv (24.44) 


of the dipole moment py; and the polarizability tensor « of the mole- 
cule. The yz; and ay are expanded as power series (24.14) and (24.19), 
only certain terms being allowed in the expansions on account of 
symmetry. The allowed terms give one immediately the correspond- 
ing fundamental frequencies, overtones and combination tones 
(24.33) which will be present in the spectra. One also obtains a 
rough lower limit estimate of the intensity. Such an analysis is 
very quick and helpful in arriving at a first tentative interpretation 
of a newly observed spectrum. If, however, one tries to derive the 
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complete and detailed form of the spectrum in this way, one runs 
into the following difficulties: (i) the “other terms” in (24.28) give 
rise to lines in addition to those we have considered; (ii) with 
degenerate frequencies some of the N-quanta transitions degenerate 
into ones of apparent lower order; (iii) the anharmonicity of the 
potential splits the lines quite appreciably and upsets the calculation 
of the expected intensities. 

Having made a preliminary identification of the spectrum, it is 
therefore better to draw up a complete table of the allowed energy 
levels with their associated irreducible representations, taking 
into account the splittings of the levels due to anharmonicity. 
One can then determine systematically all the allowed transitions 
between these levels from the rigorous selection rules (24.9) and 
(24.13), which are derived directly from (24.44). This is not too 
arduous a task because the temperature dependent factor (24.34) 
in the intensity usually allows one to restrict one’s attention to just 
the few lowest lying states for the initial state yin. It should be 
emphasized that this procedure is rigorously exact. Each energy 
level has a definite irreducible representation of the symmetry 
group of the molecule associated with it. Although these repre- 
sentations are calculated with the help of the simple harmonic 
oscillator wave functions, they remain unaffected by the anharmonic 
part of the potential which only affects their energies and thus 
produces splittings (see §6). On the other hand the transformation 
T', (24.29) is a purely mathematical device, unrelated to the sym- 
metry of the molecule, and applies only to the simple harmonic 

oscillator Hamiltonian. Thus the simple harmonic oscillator selec- 
tion rules and degeneracies derived from 7, do not apply rigorously 
any more when the anharmonic part (24.35) of the potential is taken 
into account. 


References 

Wilson, Decius and Cross (1955) discuss the theory of vibrational 
spectra in detail, and Herzberg (1945) tabulates and interprets the 
actual spectra of a large number of molecules. 


Epilogue 

In the last two sections and earlier in this book, we have often 
used the ozone molecule for illustrative purpose, assuming that the 
oxygen atoms form an equilateral triangle. An analysis such as 
described above then predicts a vibrational spectrum which is 
different from the observed one, so that the assumption cannot 
be correct (Herzberg 1945, p. 286). Electron diffraction studies and 
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an analysis of the binding energy of the molecule also support this 
conclusion. The correct configuration appears to be an obtuse- 
angled isosceles triangle. 


PROBLEMS 


24.1 What strong two-quanta transitions do you expect to find 
in the infra-red and Raman spectra of some of the molecules of 
problems 23.1 and 23.2? 

24.2 A certain molecule with point-group symmetry com has 
three low lying vibrational energy levels, the associated irreducible 
representations being A,, HZ, and H,. What infra-red and Raman 
transitions are allowed between these levels, and between them 
and the ground state? 

24.3 The methane molecule has a triply degenerate frequency 
corresponding to the normal co-ordinates 91, g2, ¢; which transform 
according to the representation 7', of the point-group 43m. Deter- 
mine the degeneracy of the n = 2 level and how it is split by the 
anharmonic part of the potential. What infra-red and Raman 
transitions are allowed between these states and the ground state 
of the molecule? 

24.4 The vibrational frequencies of the carbon dioxide molecule 
are about 1350 cm-! (Aj), 667 cm~! (H,,) and 2349 cm! (Ag) 
(see equation (23.24)). Taking the ground state as the zero of 
energy, make a list of the approximate energies of the first few 
excited states. List also all the transitions Z, — EH, that are allowed 

in the infra-red and Raman spectra with #,< 10,000 cm—! and £, 
<1000 cm=1, and are less than 4-quanta transitions. Check this 
against the list of observed frequencies (Herzberg 1945, p. 274). 

24.5 Derive for the three-fold degenerate case the expression 

analogous to (24.42): 


X(T) = H2x(T)x'*-Y(T) + 
+ 4 fx(P?) — Ex(T)3x'9-2(7) + x(T)). 
24.6* Derive (24.43) and the analogous result for the three- 
fold degenerate case (Wilson ef al. 1955, p. 352). 
24.7 D is a real representation of some group. Show that the 
symmetrical product D x D (sym) contains the identity representa- 


tion once. 
24.8 Show using the method of § 9, that 


Di) x DN (sym) = > DY) 
with J =%, % —2,% —4,...,2,0, 
18 
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where D‘J) is a representation of the full rotation group, and j is an 
integer. What is the corresponding result when j is half an odd 
integer? 

24.9 The Hamiltonian # for a doubly-degenerate simple 
harmonic oscillator is invariant under (24.29) and under ag > ag 
cos §— a, sin 6, a, >a, sin 6+ a, cos 8. Hence show that the 
symmetry group of # is isomorphic with the full rotation group 
(§ 8), and derive the degeneracy of the higher energy levels. Com- 
pare with problem 24.10 and with the method of Wilson e¢ al. 
(1955, p. 352). 

24.10 Deduce from the symmetry of the Hamiltonian that 
the energy levels of a two-dimensional isotropic harmonic oscillator 
are (n + 1)-fold degenerate, where » may be any positive integer 
or zero (Baker 1956). Use the fact that. the Hamiltonian in the form 
hw(a,*a, + a,*a,-+1) is invariant under the group of two- 
dimensional unitary matrices operating on a, @, and this group 
is almost isomorphic with the full rotation group (problems 8.13, 
8.15 and 8.17). 

24.11* Extend the results of the previous problem to an n- 
dimensional isotropic harmonic oscillator (Baker 1956). Can this 
approach be used to calculate the splitting of the levels by anhar- 
monic terms in the potential? 

24.12* Raman scatiering intensity. Give a quantum mechanical 
derivation of the scattered Raman intensity formula (24.10). 

Notes: the author has been unable to find an elementary modern 
derivation of this, but had no difficulty in obtaining one along the 
following lines. Consider the whole system of electrons and nuclei 
and electromagnetic radiation interacting with one another. The 
transition probability to a new state in which one photon w, is 
absorbed and we is emitted is given by second order perturbation 
theory (equation (29.20) of Schiff 1955). It is necessary to consider 
two types of intermediate state, one in which w, is emitted after 
w, is absorbed and one in which ws is emitted first. The one-photon 
matrix elements are given by equations (50.9) and (50.13) of Schiff 
1955. To reduce the resultant expression to the desired form, it 
is necessary to use some identities of Dirac (1927) and Schiff (1955, 
equation (35.20)). The assumptions that the ground state is elec- 
tronically non-degenerate and that the vibrational quantum is 
small, plus the use of time-reversal symmetry, allow one to cast the 
result into the form of (24.10) (Placzek 1934). To obtain the con- 
nection with the polarizability a, it is necessary to obtain a, by 
calculating the dipole moment induced by a time-dependent electric 
field @ cos af. 


Chapter VI 
SOLID STATE PHYSICS 


25. Brillouin Zone Theory of Simple Structures 


The translation group and the Brillouin zone 


In this and the next section we shall develop the irreducible 
representations of the symmetry groups of crystalline solids, with 
applications to the motion of an electron in a perfect crystal. Such 
a group is called a space group © and includes both translational and 
rotational symmetries. The distinguishing feature of a crystal is, of 
course, that it consists of a regular periodic array of identical cells. 
Thus fundamental to any space group is the translational group T 
consisting of all translations t, of the form 


| ty = M8, + MA, + Nga. | (25.1) 


Here a,, a, and a, are three primitive translation vectors (not coplanar) 
along the crystal axes, and ,, 2, n, are any set of integers. For 
instance the face centred cubic lattice (Fig. 27) is obtained by taking 


a, = ta(iy + i), a, = ga(iz + iz), a, = $a(iz + iy), 
(25.2) 


where a is the cube length and iz, iy, i, unit vectors along the edges. 


(b) 


Fig. 27. (a) Primitive translation vectors and primitive unit cell 

of the face centred cubic structure. (b) Primitive translation vectors 

of the body centred cubic structure. The primitive cell is obtained 
by completing the rhombohedron. 
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Now a finite crystal is not really invariant under any of the ty, 
(25.1). Whereas t, leaves the middle of the crystal invariant where 
the structure is really periodic, it displaces all the sides of the 
crystal to new positions. To make use of the translational symmetry 
therefore, we have to modify our problem so that it has € as an 
exact symmetry group. We do this by one of two artifacts. Take 
as a sample a crystal of size N,a, by N,a, by N,a, where N,, Nz, 
N, are large numbers. We either consider an infinite number of 
identical such samples stacked together or we imagine opposite 
faces of the crystal to be made mathematically to join up in the 
same sense as we can join up the two ends of a straight line by 
bending it into a circle. In either case we regard the three translations 
N,a,, Na, Na, as affecting no change at all and we put them 
equal to the identity element, i.e. to zero translation 


ty,00 = ton,o = toon, = tooo- (25.3) 


Expressed in terms of the wave function , this procedure gives the 
usual periodic boundary conditions 


vr) = P(r + Nya.) = o(r + Nya.) = oe + Nga;). (25.4) 


The artifact does not, clearly, correspond to anything physical and 
occasionally it has to be used with care, e.g. in discussing any 
surface effects. 

The translations (25.1) with the restriction (25.3) now form a 
finite group of N,N,N, elements. The group is Abelian, so that 
all the irreducible representations are one-dimensional (appendix 
E) and can be written down by the method of §7. They are labelled 
by a wave vector k such thatt 


t, is represented by exp(tk: ty), (25.5a) 
or trv (r) = wy (r + tn) = exp(tk *tn)X (1), (25.5b) 


where y,(r) transforms according to the irreducible representation 
k. A function which has the property (25.5b) is called a Bloch 
function, and (25.5b) is taken to define its k vector. 

From (14.14), the group T has only N,N,N, distinct representations, 
and we obtain these by putting two types of restriction on k. The 


ft No confusion should arise from using tn both as a translation operator as 
in try(r) and as an ordinary vector in exp(iK-t,). 
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first is obtained by noting that there exists in k-space a whee 
lattice of vectors 


Km = m,K, + m,K, + m,K,, | (25.6) 


called the reciprocal lattice, each satisfying the condition | 
exp(?K -t,) = 1 for all ty. | (25.7) - 


This is easily shown along the lines of problem 25.2. It follows 
that k is not uniquely defined by the property (25.5), since k and 
k + Ky give identical representations 


exp tk-t, = exp i (k + Km)-tn for all ty. . (25.8) 


To label the representations uniquely, it is therefore necessary to 
restrict k to some region which is a unit cell of the reciprocal lattice. 
The particular unit cell which has been adopted by convention is 
obtained by bisecting the lines joining K = 0 to the nearest reciprocal 
lattice points. It is called the Brillouin zone and from now on we 
shall assume without further mention that the k’s which we use 
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Fie. 28. Reciprocal lattice of the square lattice. The dots are the 

reciprocal lattice points K,. A BC D is the conventional Brillouin 

zone, but any other unit cell of the reciprocal lattice such as E F G H 

could in principle be used instead. All the points marked x denote 

the same irreducible representation in the sense of (25.8) as the point 
k inside the Brillouin zone. 
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to denote representations always lie in the Brillouin zone. With this 
convention k is often called the reduced wave vector and the 
Brillouin zone the reduced zone. Fig. 28 illustrates the situation 
for the two-dimensional square lattice, which we shall use for 


(b ) 


Fig. 29. Brillouin zones of (a) face centred cubic structure, and 
(b) body centred cubic structure, with special points shown. 


illustrative purposes throughout this section although all our 
arguments apply equally to one, two and three dimensions. Fig. 
29 shows the Brillouin zones of the body centred and face centred 
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structures. The second restriction on k comes from applying 
(25.3) or (25.4), and we obtain 


a7 wa ws 
k= aK +a & + Ks (25.9) 


where rj, 72, 73 are integers. This gives correctly the total number of 
N, N, N, different irreducible representations in the Brillouin zone 
see problem 25.4), 


Fie. 30. Brillouin zone of the square lattice, showing some points 
related by reciprocal lattice vectors. 


In this scheme, there is still some ambiguity about k’s on the 
surface of the zone. Since the Brillouin zone is a unit cell of the 
reciprocal lattice, opposite faces are always separated by a reciprocal 
lattice vector. For example in Fig. 30 


kr —= kg K,, 


so that ky and kg denote the same representation and we shall use 
either one of them indiscriminately. Likewise there is one re- 
presentation which can be referred to by either k4, kg, kc or kp 
(Fig. 30). 


Other symmetry elements 


As well as the translation group @, a space group © in general 
contains purely rotational symmetry elements R and combined 
rotational-translational elements {R|t}. Among the latter, t is 
not necessarily restricted to the lattice translations (25.1): if the 
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crystal symmetry involves screw axes or glide planes, then t may 
be a fractional lattice translation (§ 26). In any case a translation 
t does not alter the k vector of a function,t 


tok(r + t) = &(F + t + te) = exp(tk : ta)yy(r + t). 


However, the rotation R transforms a function ,(r) with wave vector 
k into a new function yy(r) with wave vector k’, where k’ is derived 
from k by the rotation R applied in k-space. This result which is 
intuitively obvious, follows from the fact that in (25.5) the scalar 
product k-+t, remains invariant if we apply the same rotation of 
axes in both k-space and real space. 

The question now arises, is the concept of a k vector any use in 
sorting out the irreducible representations of ©, in view of all the 
rotational symmetry elements? Consider some vector space R 
transforming according to an irreducible representation of ©. 
Since G contains the translation group € as a subgroup, X is 
invariant under € and we can reduce it with respect to € to give a 
set of base vectors} #(k,; r), ¥(k,; r), . . . with definite k vectors. 
Thus 


in setting up the irreducible representations of any space 


group &, we can restrict ourselves to base vectors which 
are Bloch functions, 


(25.10) 


i.e. functions each of which has the property (25.5b) with some 
definite k vector. . 

Consider now the energy of an electron moving in a crystal. 
The Hamiltonian is 


—(h2/2m)V? + V(r) (25.11) 


where the crystal potential V(r) has the symmetry of the crystal. 
| From the result (25.10) we may use the Bloch functions 4,(r) as 
our basic energy eigenstates, the energy H being given by some func- 
tion H(k) varying continuously with k through the Brillouin zone. 
E(k) is normally a multivalued function of k, the branches being 
numbered one, two, three, . . . in order of increasing energy, starting 
arbitrarily with the lowest one of interest, e.g. in metallic sodium 


t Note that in general yy (r + t) = exp(tk-t)ya(r) is only true if t is a lattice 
translation t,,. 


t To avoid multiple suffices, we shall on occasions write ¥(r) as ¢(k; r), 
or by way of abbreviation as (k). 
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with the one relating to the valence electron. Because of the discrete 
nature of the allowed k vectors (25.9), a branch of £(k) really 
represents a band of closely spaced energy levels, and the form of 
E(k) throughout all bands is called the band structure of the crystal 
as illustrated in Fig. 31. 


E(k) 
] th band H 
] th bond I 
H th band 1 
! 
t 
rd band 
i 2nd bond | 
' ’ 
1 Ist be ' 
i t 
| 
-7 oO 4 hy 


Fig. 31. A typical band structure H(k) with several bands, plotted 
along the kz axis. 


Let 4(k,; r) be an energy eigenstate with k, at a general point 
in the Brillouin zone (Fig. 32a). Then the various rotations will 
generate from this other functions ¢(k,; r), %(k,; r),..., and from 
the fundamental theory of § 6 we have that these are also energy 
eigenfunctions with 


E(k,) = E(k,) = E(K,) =... (25.12) 


Thus E(k) has the full point-group symmetry of the crystal, and this 
conclusion is independent of whether the axes are pure rotation 
axes or screw axes. This theorem allows us to investigate the form 
of #(k) near the surface of the Brillouin zone. Let us consider one 
such band, which we shall assume has no degeneracy with any 
other band. In Fig. 32a, we have EH(k,) = E(k,) by rotational 


+ The branches of H(k) are sometimes referred to as the “first zone’, 
“second zone”, etc., because of the way they can be remapped in the extended 
zone scheme (problem 25.11). For our purposes the region of k-space, namely 
the Brillouin zone, is the same for all of them: it is just that there are different 
states with different energies having the same irreducible representation k. 
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symmetry. Now k’, and k”, are removed from k, by reciprocal 
lattice vectors K, and —K,, and they may be used instead of k, 
to designate the states. In fact the choice of the Brillouin zone as 
our fundamental cell in k-space was rather arbitrary, and we could 


-T ky 0 a a 

{b) 
Fia. 32. Symmetry of the band structure. (a) Brillouin zone of the 
square lattice with points related by rotational symmetry. (b) H(k) 


plotted along the line k, k, k’, of Fig. 32a, with a maximum at the 
zone boundary kz = +7/a.— 


equally well have chosen a cell displaced sideways somewhat to 
include k’, and to exclude k,. Thus we can conveniently extend 
E(k) as a continuous periodic function in the whole of k-space with 


E(k,) = E(k',) = B(k",) =... ete. 


as shown in Fig. 32b. By choosing k, near enough to the Brillouin 
zone boundary, we see from the relation H(k,) = H(k’,) that H(k) 
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has a minimum or maximum as it crosses the zone surface. L[.e. 
in the case of the square lattice, the normal component of grad,Z 
is zero on the zone boundary. 


General representations of a space group © 


At this stage we shall restrict ourselves to space groups involving 
ordinary rotations but without screw axes and glide planes, and 
return to the general case in the next section. In symbols, all the 
symmetry elements have the form {R|t,}. We have already 
shown (25.10) that in setting up the irreducible representations of & 
we can choose as our base vectors Bloch functions 4,(r). These 
are all invariant under t, (apart from the factor exp ik: t,) so 
that we need only consider the effect of the rotations R which by 
themselves form a point-group Pp. 

Let %(k,) be a base vector of some irreducible representation of 
©, where k, is at some general point in the Brillouin zone (Fig. 
32a). We shall construct from this the rest of the representation. 
By operating on ¥(k,) with all the rotations R; of P (where R, is 
the identity element F), we generate other functions ¥(k;). These 
k; are all different for, by assumption, k, is at a general point in the 
zone without any rotational symmetry properties. They form a 
set of vectors having the symmetry of the whole point-group like 
the stereograms of Fig. 9 or the set k, to kg in Fig. 32a. The first 
thing to prove is that these ¥(k;)’s really form a representation. 
Let Rj rotate the vector k; into kj. Then R; can be written 


Ry = FR, 
and hence 
Ryf(ky) = RrReMp(ky) = Reply) = Yk). (28.1) 
Thus each R; transforms any ¢(k;) into some other ¥(k;), so that the 
functions 
¥(K,), ¥(K,), .. . (Ke), . . « (25.14) 


definitely give a representation of G. Moreover the representation 
is irreducible. For since each k; occurs only once in the set (25.14), 
any way we might try to reduce the representation would involve 
taking linear combinations of functions with different k’s. This 
we have already proved in (25.10) that we need never do, and we 
conclude the representation cannot be reduced. 


Special points in the zone 


The above argument fails for special points in the Brillouin zone 
because two or more of the k; become identical. The k; may become 
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identical either (i) by virtue of lying on a rotation axis or mirror 
plane, or (ii) by being separated by a reciprocal lattice vector from 
one another (see Fig. 30), or (iii) by (i) and (ii) combined. The six 
kinds of special point for the square lattice are shown in Fig. 33a, 
ir, 4 and 2 belonging to type (i), Z belonging to type (ii), and X 
and M to type (iii). 


(a) (b) 
Fic. 33. Brillouin zone of the square lattice, showing special points. 
(a) The six types of special point. (b) The star of 4. 


We first consider 4 as a representative of the class (i) type of point. 
The point-group symmetry P for the square lattice ist 4mm (see 
§ 16). Let 4(k,) be a Bloch function with k, at 4 (Fig. 33b), belong- 
ing to some irreducible representation of ©. The elements of PD can 
be divided into two classes which leave k, unaltered or change the 
k of the function respectively, as follows: 


class I, k, unaltered:—E, my; 
class II, k changed:—4,, 22, 423, mz, ma, ma’. 


The transformations of class IL produce functions with k = k,, 
k,, k,, and these together with k, make up what is called the star 
of 4 as shown in Fig. 33b. Now the rotations of class I satisfy 
each of the group properties of § 4 because they all leave k, invariant. 
This can be verified in detail. Alternatively it follows from the gen- 
eral result of § 4 that symmetry properties always form groups. This 


t In two dimensions, 4mm is the same as 422. It should be noted in what 
follows that mz denotes reflection in a line (or plane) perpendicular to the 
x-axis. Suffixes d and d’ denote the diagonals of the square. Rotations of 
90°, 180°, 270° about the z-axis are denoted by 42, 22, 423. 
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small point-group which leaves k, invariant is called the group of 
k, or the group of 4, and is denoted by XH. In the present case it is 
the point-group m with two representations, the symmetric and the 
antisymmetric one which we shall call 4, and 4,. 

We can now take one of these irreducible representations 4, 
or A, of K, and build it up into an irreducible representation of &. 
We first choose a (k,) that is symmetric or antisymmetric under 
my (4, or 4,). Then using the elements of class II (not leaving k, 
invariant), we generate functions with k = k,, k, and k,. Since 
ma = 4,my and since my is an element of KH, the two operations 
ma and 4, produce the same function: likewise m; and 2, do, and 
also mg and 4,°. Thus we obtain four functions $(k,) to #(k,), 
and in the same way as for the general representation (25.14), it 
follows that they always transform into one another and give an 
irreducible representation of ©. Moreover the representation is 
completely specified by the representation A, or 4, of K that we 
started with. This procedure is quite general for obtaining an 
irreducible representation of the whole space group & from an 
irreducible representation of H. The matrices look like 


D ; 
D’ 


where D, D’, D", etc., are matrices of the irreducible representation 
of K. However, for most practical purposes it is sufficient to discuss 
the irreducible representations of H plus the overall full symmetry 
of E(k) without ever writing out the full representation. 

In the same way we obtain the irreducible representations of 6 
associated with the point I, k = 0 (Fig. 33a). In this case the star 
of I" consists of the single point k = 0. H, the group of I, is the 
point-group 4mm, whose irreducible representations can be looked 
up in appendix K. They are shown in Table 24 numbered arbitrarily 
I, to I, as is the fashion in solid state physics, and these symbols 
are also used for the corresponding representations of 6. The 
situation for special points on the zone boundary (types (ii) and (iii) 
above) is exactly the same. One only has to remember that corre- 
sponding points on opposite sides of the zone have to be regarded 
as the same k vector. For example the group of X (Fig. 33a) includes 
the transformations 2, and mz, even though these turn the point 
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(z/a, 0) into (—7/a, 0). Thus the group of X is the pomt-group 
mm with the irreducible representations shown in Table 24. Simi- 
larly with M, all the zone corners correspond to the same k vector 
which may be taken as any one of the four vectors (+7/a, +7/a). 
The group of M is 4mm, the same as that of I’, so that the irreducible 
representations for the two points are shown together on the same 
table (Table 24). Those of 2 and Z are likewise shown together 
with those of 4. 


TABLE 24 
Irreducible Representations at Special Points 


r, M E 22 42, 4,8 Ny, Mz Ma, Ma’, 
r,, M, 1 1 1 1 1 
T,, M, 1 1 1 =1 =I 
r;, M; 1 1 —1l 1 —1 
Ty My 1 1 = Sel 1 
r;, M; 2 —2 0 0 0 

wc A ee 
x E 22 Mz my 
be 1 1 1 1 
X, 1 1 -1 ai 
x, 1 =) 1 ~1 
X, 1 = =i 1 
A Ez My 
Zz E ma 
Z E Mz 
A, a1, Z, 1 1 
A;, 22, Z, 1 aa 


Energy level splittings in a crystal 


Our precise specification of the irreducible representations of G 
allow us to make applications to the splittings of energy levels and 
degeneracies between bands, to which we now proceed. Let us 
approximate to the eigenfunctions of the Hamiltonian (25.11) by 
plane waves,{ and consider in particular the four 


exp t(n/a)(x + 9), exp t(m/a)(% — y), 
exp i(nja)(—x +y), exp im/a)(—x — y). (25.15) 
+ Or better by orthogonalized plane waves (Herring 1940). 
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These all have the same reduced k vector, which may be taken 
arbitrarily as any one of the four (+7/a, +7/a), say k = (x/a, 
aja). That is, they belong to the point M in the Brillouin zone 
(Fig. 33a). The four plane waves (25.15) are equivalent to one 
another by rotational symmetry and have the same energy. Thus 
in this approximation, we have four consecutive bands degenerate 
with one another at M. However, the four functions (25.15) form 
a representation which is reducible into the components M,, M, 
and M, as shown in Table 25, so that the presence of the potential 


TABLE 25 
Representation Formed by (25.15) 


Complete E 22 42 Mz Ma 
representation 4 0 0 0 2 
Irreducible M, 1 1 1 1 1 
components M, 1 1 —l —l 1 
M; 2 ~—2 0 0 0 


V(r) in (25.11) will split the level into two singlet levels and a 
doubly degenerate one. The particular order of the levels depends 
on V(r) and can only be established by detailed calculation (problem 
25.13) but the M, two-fold degeneracy will remain in any approxi- 
mation. 


Compatibility relations 

In any one band certain compatibility relations must be satisfied 
between the irreducible representations of states along the lines 
specified by 4, X' and Z on the one hand, and the irreducible re- 
presentations of I’, X and M at the corners on the other hand (Fig. 
33a). This is because X for instance is both a special point in its 
own right, and yet also a point on the lines 4 and Z. Consider for 
example the wave function {(4,; r) belonging to the irreducible 
representation 4,: it changes sign under the reflection m,. Now 
in any one band #(k; r) varies continuously} with k, so that as 
k = 4 approaches X, the function %(4,; r) must turn into a function 
which is still odd under my, i.e. into a function belonging to either 
the representation X, or X, since these are the only ones odd under 
my as can be- seen from the character table (Table 24). All such 
compatibility relations for the square lattice are given in Table 26. 


f Except possibly at points of accidental degeneracy (see below). 
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The case of the two-dimensional representations I’; and M, 
is slightly more complicated. Taking I’; for instance, we have two 
degenerate wave functions ¥,([,; r) and ¥,(2,; r) with k = 0. 


TABLE 26 
Compatibility Relations 
Representation Compatible with . 

A, Pit Pe ok, 
4, Poilp fe Xe Xe 
ay I, Vy Is M;, Ms, M,. 
2 I, I's, I's; Me, Ms, M,. 
Z, X,, Xs; M,, M;, M,. 
Ze Xe, Xy M,, M,, M;. 


I, reduces into 4, + 4, or 2, + 2, 
M, reduces into 2, + 2, or Z, + Zs, 


As we go along the line 4 (Fig. 33a), the only symmetry elements 
remaining are F and my, and under this more restricted group the 
representation I", becomes reducible into 4, + 4. Thus according 
to the general principles of § 6, the doubly-degenerate level at I’, 
splits into two singlets as we go in the direction 4. The same applies 
to the line , so that we have two bands degenerate at the single 
point k = 0. | . 


Degeneracy 

There are three different basic types of degeneracy between 
bands. The first type, which we have already encountered, is 
exemplified by the representations I, and M,. These are two- 
dimensional, so that we have a degeneracy between two bands 
at a definite point in the Brillouin zone. This type is called an 
essential degeneracy because it is due to symmetry properties and 
cannot be altered or split by changing the crystal potential V(r) 
slightly. Both of the other two types of degeneracy depend sen- 
sitively on V(r) as regards the exact value of k at which they occur, 
and are called accidental degeneracies. 

The second type of degeneracy is between two tequivalent 
representations and is illustrated by Fig. 34. Suppose we find by 
direct calculation from the Hamiltonian (25.11) that the lowest 
two levels at I’ and X have the representations I,, I, and X,, X; 
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as shown. From the compatibility relations (Table 26), we obtain 
the representations at 4 between [and X. We note that J, turns 
into 4, which cannot change into X, but must join up with X,. 
Similarly I, turns into 4, which joins on to X,. Thus the curve 
T,4,X, must cross I',4,X, at some point P, and we obtain a degen- 
eracy between the two bands at P. Incidentally it is important 
that the bands should be numbered strictly in accordance with 


r 4 x 


Fie. 34. Representations and H(k) along the line PX, showing 

degeneracy between two inequivalent representations 4, and 4, at 

P. The dotted curves show how E(k) would vary if we took a curved 
path from I to X that by-passes P slightly. 


increasing energy. Thus in Fig. 34, [4,P4,X, belongs to the 
first band and I',4,P4,X, to the second. The necessity for this 
can be seen by considering a path from I" to X which is slightly 
deformed from the straight line [4X so as to avoid the point P: 
in this case the energy varies as shown by the dotted line with 
yy, varying very rapidly with k near the point P, and we see that 
I, and X, must clearly belong to the same band. 

The third type of degeneracy is between equivalent representations, 
in particular at general points in the Brillouin zone where there is 
only the.one type of representation. This sort of degeneracy can 
occur along a line of degeneracy in three-dimensional crystals with 
a centre of symmetry, but not over a whole surface in k-space. 
For an outline of the proof of this result, see problem 26.8, from 
which the general method of tackling such a problem in any number 
of dimensions or in any special case should be clear. In two dimen- 
sions with a centre of symmetry, only points and not lines of degen- 
eracy are allowed. The whole subject of accidental degeneracy has 
been fully discussed by Herring (1937b). The third type seems to 
be rarely met with in practice, though the first and second types 
are common. 
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Summary and references 


Various effects of crystal symmetry have been described, and in 
particular the irreducible representations of space groups not con- 
taining screw axes or glide planes. The translational symmetry of 
wave functions gives a k vector, which can be used to label the 
base vectors of a representation under all conditions. The wave 
vector k conventionally varies over a restricted region of k-space 
called the Brillouin zone. Most irreducible representations, i.e. 
those involving general k vectors, are of dimension n, where 7 is 
the number of elements in the point-group symmetry of the struc- 
ture. The situation is more complicated for k vectors lying on 
symmetry axes, mirror planes or the surface of the Brillouin zone. 
The irreducible representations have been used to discuss energy 
level splittings and degeneracies (Herring 1937b) between different 
bands of the energy function Z(k) of an electron travelling in the 
solid. The case of the two-dimensional square lattice has been 
treated in detail. Analogous results for the simple cubic, face 
centred cubic, and body centred cubic structures have been given 
by Bouckaert, Smoluchowski and Wigner (1936). Further refer- 
ences are given at the end of § 26. 


PROBLEMS 


25.1 Verify that the 1 by 1 matrices (25.5a) really give a repre- 
sentation of T. 

25.2 The reciprocal lattice. Prove that the condition (25.7) 
defines a lattice of vectors (25.6). Hint: in the notation of (25.1), 
show that 


27a, /\\ as 2Qra, / a, 2a, \ a, 
K, = ao K, — A eae | = A Z 
a, A 89°a, a, (A 4,'a, a, A a,°a, 


satisfy the relation 
Ky > ay = 2784. 


Put K = m™,K, + m,K, + m,K, and show that m,, mm., m, are 
integers. 

25.3 Verify that the vectors (25.2) form a set of primitive 
vectors for the face centred cubic structure, and find a set of primitive 
vectors for the body centred cubic and the simple cubic structures 
(Fig. 27). Derive and describe their reciprocal lattices and Brillouin 
zones. 

25.4 Show from (25.3) that k is restricted to the points (25.9). 
Calculate the density of points in k-space from (25.9) and the volume 
of the Brillouin zone, and verify that there are N,N,N, distinct 
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representations, i.e. one per primitive unit cell in the crystal. Thus, 
counting spin degeneracy, the Brillouin zone contains two electron 
states per primitive unit cell of the crystal. It should be noted that 
in this statement the cell referred to must be the primitive one, 
i.e. not the cube of Fig. 27a but the cell defined by (25.2). Further- 
more it is irrelevant how many atoms there are in each primitive 
unit cell: while this number is one for body centred and face centred 
cubic metals, it is two for the hexagonal close packed metals, for 
the diamond, tin and bismuth structures, and is higher still for more 
complicated structures such as most salts and molecular crystals. 

25.5 Use the idea of translational invariance to discuss the 
eigenstates of a particle moving freely under no forces. 

25.6 Test whether the following functions transform irreducibly 
under the translation group of the square lattice (length of square 
=a), and if so what are their reduced k vectors: exp(}iz/a), 
exp 1(n/a)(—$x — 3x), cos(}rn/a), sin(yx/a), exp t(m/a)(x + y), 
exp i(a/a)(x — y), cos(xm/a) exp(iym/a), cos(8x7/a), cos a/a. 

25.7 Show that a Bloch function (25.5b) can be written in the 
form 

wy (r) = exp(tk - r) u(r) 


where %(r) is periodic in the lattice. Hence show that in general 
(25.5b) holds only for lattice translations t, (25.1) and not for 
arbitrary translations t. 

25.8 The energy “#(k) in a particular band is a single-valued 
function of k inside and on the surface of the Brillouin zone of the 
face centred cubic structure (Fig. 29a) i.e. there is no degeneracy 
with other bands. Show that the normal component of grad,# 
is zero over the whole of the square faces of the zone, but that in 
general it is zero on the hexagonal faces only on the lines joining 
opposite corners of the hexagon. 

25.9 Show that 


> exp tk : th = NNN 350, 
nN 
>, exp tk + ty = NiN2N35n0, 
k 


where the summation over k is over all allowed k’s (25.9) in the 
Brillouin zone, and where Sy9 is the Kroenecker delta symbol 
(A.15) applied to each component kz, ky, k, of k. Note that these 
relations are just the orthogonality relations (14.8) and (14.18). 
25.10 A function ¢ = a%(k,) + 6y(k,) belongs to an irreducible 
representation of a space group 6. Show that ¢(k,) and ¢(k,) 
belong to this same irreducible representation and hence may be 
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used as base vectors for the representation. This is an alternative 
proof of (25.10). Hint: let t, be a lattice translation such that 
exp ik, : ty ~ exp ik,: ty. Consider ¢ and tnd, and show 


[exp(tk, - tn) — tn]d 


op(E,) = S5GK,* ta) — expGk, « t9)" 


25.11 The free electron model and the extended zone scheme. 
Consider a square lattice but with the potential V(r) in (25.11) 
set equal to zero. The energy eigenfunctions are then exp(tk’ - r) 
with energy E(k’) = 4?(k’)?/2m. With the aid of Fig. 35, redescribe 
E(k’) by a multivalued function Z(k) with k in the Brillouin zone, 


Fic. 35. Diagrams for problem 25.11. (a) The extended zone scheme 
in k’-space, (b) remapped into four Brillouin zones. 


and draw the energy contours of H(k) in the first four zones. Note: 
when the wave functions are rather like plane waves, it is for many 
purposes convenient to denote them by an unrestricted actual 
wave vector k’ rather than the reduced wave vector k. Then 
E(k’) is a single-valued function of k’, with energy discontinuities 
at the lines of Fig. 35a. This is called the extended zone scheme. 

25.12 Write down the group of k and the irreducible representa- 
tions of the face centred cubic structure associated with the point 
X (Fig. 29a). 

25.13 By inspection or using (14.11), write down the linear 
combinations of the plane waves (25.15) which transform respectively 
according to M,, M,, and M,. Calculate their energies in the nearly 
free electron approximation, i.e. treating V(r) in (25.11) by first 
order Setacbston theory. Note: in actual metals the potential 
V(r) is so strong as to make nodes in the wave functions near the 
atomic nuclei, and this may alter the order of levels from that 
found in the nearly free electron approximation. 
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25.14 Consider the functions 


bm = 2 dm(r — Rn) 


where the dm, m = 2,1, 0, —1, —2 are atomic 3d functions and the 
R, are the atomic sites in a face centred cubic metal, e.g. copper. 
What representation of the space group do the %m form, and to what 
extent will they be degenerate in the solid? (In an isolated atom 
all five $m are, of course, degenerate.) 

25.15* Derive the energy levels and irreducible representations 
at all special points for the first two bands of the body centred 
cubic structure, using the nearly free electron approximation (see 
problem 25.13; Mott and Jones 1936, p. 63; Bouchert et al. 1936). 
Compare with the results of a detailed calculation for potassium 
by Callaway (1956). 

25.16* Discuss the energy E(k) of Bloch functions made up 
out of overlapping p functions in a simple cubic structure using the 
tight binding method in the nearest neighbour approximation 
(Mott and Jones 1936, p. 70). Derive the irreducible representa- 
tions at all special points in the Brillouin zone, discuss all degeneracies 
and state which degeneracies (if any) would disappear on taking a 
better approximation (Bouckaert et al. 1936). 

25.17 Verify the compatibility relations of Table 26. 

25.18 Perturbation theory for E(k). If the Bloch function y,(r) 
is written in the form of problem 25.7, show that u(r) satisfies 


[—(h?/2m)V? — (th?/m)k-V + V(r) + (A7k?/2m) py (r) = E(k)uy(r). 


Hence, discuss how H(k) at a point k =k, + &k near k, can be 
investigated using the perturbation operator P where 


P = (—th?/m)V-5k + (h?/m)k, Sk + (#?/2m)(8k)*. 
Verify that 
[ wk, n; v)Pu(k,, n’; r) dv 
= (h?/2m)(8k)"8nn' + (h/m)dk- i $*(k,, 2) (—thV) p(k, 2’) de, 
where » and n’ refer to the different bands Z,(k). Using this tech- 


nique, prove that grad, H = 0 at the point X in the square lattice 
(Fig. 33a), and show that one of the effective mass parameters 


he i? 
aBloky? "WY ~ otB/aky? 


is likely to be small if the energy gap to the next zone is small. 


Mea = 
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25.19* Use the irreducible representations of space groups to 
describe the elastic lattice vibrations of some simple solids such as 
the face centred cubic structure (Peierls 1955, Houston 1948, see 
also § 23 and Herring 1954). Derive selection rules for the Raman 
spectrum and show that in first order it should consist of lines 
rather than bands (Peierls 1955, Stephen 1958). Discuss in this 
light the ideas of Raman on this subject (see Bhagavantam et al. 
1948, Menzies 1953). 

25.20 Show that the space group of the square lattice is not 
a “direct product” of the translation group € and the point-group 
4mm in the sense of § 15, and verify that the representation theory 
of § 15 does not apply. 

25.21 In aluminium (face centred cubic), a detailed calculation 
shows that the irreducible representations in the first and second 
bands are probably as follows: first band I,, X’,, W;; second band 
I’’5;, X1, W, (Heine 1957b). The order of levels at K is uncertain 
but assume that it is K, (band 1) and K, (band 2). Derive the 
compatibility relations at these points and show that there must 
be a line of degeneracy between the second and third bands in the 
plane kz = 0. (For the character tables and notation, see Bouckaert 
et al. 1936. See also problem 26.8.) 


26. Further Aspects of Brillouin Zone Theory 


In the latter half of the last section we limited ourselves to space 
groups which do not contain screw axes or glide planes. In this 
section we shall remove this restriction, and then go on to consider 
the effects of time-reversal and of spin-orbit coupling. Finally the 
theory will be applied to a qualitative description of H(k) near the 
top of the valence band in germanium and indium antimonide. 


Screw axes and glide planes: bands sticking together 


In § 25 we wrote an arbitrary element of a space-group in the 
form 
{Rt} (26.1) 


where t is a translation, applied first, and & is a proper or improper 
rotation following. All the possible rotations that are consistent 
with crystal symmetry have already been derived in § 16. The 
translations t include all the lattice translations ty, (25.1), and in 
§ 25 when we started discussing the irreducible representations in 
detail we limited ourselves to space-groups in which t is always 
one of these lattice translations. However, there are many space- 
groups for which this is not so, e.g. that of the diamond structure 
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and the hexagonal-close-packed structure. These contain screw 
axes and glide planes, i.e. symmetry elements which are rotations 
or reflections respectively combined with a translation by a fraction 
of a lattice displacement. For example a two-fold screw axis is a 
translation by half a lattice displacement, say }$a,, with a 180° 
rotation about the axis a,. If we apply this screw axis twice, we 
obtain a total rotation by 360° = 0° and a displacement of 2 x 4a). 
This is a pure translation, and therefore it has to be a lattice trans- 
lation t, (as of course it is in our example). This condition limits 
the kind of screw axes and glide planes that one can have. It can 
also be shown that the displacement in the case of a screw is parallel 
to the axis of rotation, and in the case of a glide plane is parallel 
to the reflection plane (problem 26.2). 


E(k) 


6th zone 


5thzone 


4th zone 


2nd zone 


ist zone 


oO 


Fic. 36. Energy band structure showing bands “sticking together” 
in pairs at the zone surface. 


The irreducible representations of the more general space-groups 
with screws and glides are derived by methods analogous to those 
of § 25. The discussion there is completely general as regards the 
definition of Brillouin zones and the reciprocal lattice, and with very 
little extension the arguments there give all the irreducible re- 
presentations of k vectors not lying on the surface of the Brillouin 
zone (problems 26.4 and 26.5). However, when it comes to the 
surface points of the Brillouin zone a new phenomenon can arise, 
described as “bands sticking together’. It can happen that some 
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special points, usually along a whole line on the surface of the 
Brillouin zone or sometimes over a whole zone face, have only 
two-dimensional irreducible representations. Along such a line or 
face we always have two bands degenerate, and #'(k) looks something 
like Fig. 36 with the bands going in pairs. It might at first sight 
appear convenient just to choose a new Brillouin zone of double 
the volume of the usual one, and to describe each pair of bands in 
Fig. 36 as a single band in the new zone scheme: but Hund (1936) 
has shown that this leads to more ambiguities and inconveniences 
that it removes, and we shall not consider the idea further. 
Following Hund (1936), we can set up a necessary condition for 
the sticking together of bands to occur. Consider the ls states in 
an array of atoms with total symmetry equal to some space-group 
© all atoms being of the same element. These 1s states will be 


N N, 
N “XN 
N N 
x \ 
e ~ e \ e ® 
“\ \ 
N NX 
N \ 
* \ 
ay N 
» N 
~ x 
‘. 
Ne x 
\ N 
e a e Se e 
Se NX 
\ \ 
XN ‘ 
\ » 
NN 
g, re bee 
> N 
“ ~ 
° \ e ‘ 
2. a 
x 
2 \. _ 


Fic. 37. Glide planes gg in a simple square array with glide 4a, 

— 4a,. This type of glide plane does not produce sticking together 

of bands because it is an inherent property of the translational 
lattice. 


separated by several electron volts from other states like 2s or 2p, 
but they will be broadened very slightly into a narrow band of 
energies due to overlap between neighbouring atoms (Mott and Jones 
1936, p. 68). There must be two ls states per atom including spin 
degeneracy, and the Brillouin zone contains two states per unit cell 
of the crystal (problem 25.4). Thus to have a pair of bands stuck 
together, we must have at least two ls bands of nearly the same 
energy and that means at least two atoms per unit cell. Moreover, 
we saw in § 25 that a space-group with elements only of the form 
{R|tn} produces occasional degeneracy between bands but not a 
systematic sticking together in pairs, since each character table 
such as Table 24 always contains the one-dimensional identity 
representation. We therefore have to fall back on screws and 
glides to produce the two atoms per unit cell. Furthermore the 
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glide plane must not be an inherent property of the primitive 
translation lattice such as in Fig. 37, for this does not give two 
atoms per unit cell either. Thus a necessary condition for the sticking 
together of bands is that the space-group contains screw axes and/or 
glide planes which are not inherent in the primitive translational 
lattice. It is not known whether the condition is also a sufficient one, 
though the rather general considerations of (26.12) and problem 
26.12 suggest that it may very well be so. 

We now consider some structure with the space group illustrated 
in Fig. 38a, and before going into any detailed discussion of the 
irreducible representations we shall show in the most direct way 


m 
w/e we oe 
aw ae 
wie we we 
ae 
g— —{*_ fhe 
Dp ae ae 


Fie. 38. (a) Pattern with the two-dimensional symmetry Pbl 
(Buerger 1942), showing a glide plane g, a mirror plane m, a centre 
of inversion O, and a unit cell of size a x b. Note that the commas 
are not supposed to represent atoms, but just to form a pattern 
which exhibits the total symmetry as clearly as possible. It is irrele- 
vant to the discussion where and how many atoms there are as long 
as they exhibit this symmetry. In two dimensions a glide plane and 
a two-fold screw axis become identical. (b) The Brillouin zone for 
the above space group. 


possible that the presence of the glide plane produces a sticking 
together of bands at the point X = (z/a, 0) in the Brillouin zone 
(Fig. 38b). Actually the sticking together occurs along the whole 
side Z, ky = m/a, of the Brillouin zone, but this will only become 
apparent when we consider time-reversal symmetry. Consider now 
a function 

w(x, y) with k = (n/a, 0). (26.2a) 


Then the following functions must also have the same k and must 
be degenerate with y: 


gb = o(x + fa, —y), (26.2b) 
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where g, IT, m are the glide, inversion and mirror reflection of Fig. 
38. We have 


gb = o(% + fa, —y) = myp(—a, —y) = mIlp (26.3) 


It is now easy to prove that all the four functions (26.2) cannot be 
proportional to one another. For let us suppose on the contrary 
that they are: then since [/?+ = 4 = m*, we must have 


Ttb=ap, a=+1, and mb=fs, B=+Il. (26.4) 
Now from (26.3) and (26.4) we have 
gs = mIImITb = amlImdy, etc. = «28%, 
and from (26.2b) 


gb = px +a,y) = exp(*Kea)p(x, y) = —¥. (26.5) 


Since «78? cannot equal —1 we have a contradiction, and it follows 
that the functions (26.2) cannot be proportional to one another. 
Every irreducible representation associated with the point X in 
the Brillouin zone is therefore at least two-dimensional, so that we 
have sticking together of bands there. 


Irreducible representations at surface points 


To show how one sets up the irreducible representations of a 
space-group with screws and glides for points on the Brillouin zone 
surface, we will now work these out in detail for the point X (Fig. 
38b) already considered. As in § 25, we first pick out K, the group 
of k, consisting of all elements of the space group & which leave k 
invariant. In the case of our point X this includes the whole of 6. 
Now in § 25 we dropped from X all the translations t, because their 
effect on any function ¥, was completely defined by k, and this 
left 1 as a simple point-group of rotations. However, this device 
will not work here because it does not leave us with a group; for 
instance the element g would be left, but g? which from (26.5) is 
the translation t, = (a, 0) would be excluded. Thus we consider 
instead the matrices ZH’, g’, m’, t’, etc. which represent K in some 
arbitrary representation. Now all translations of the form teyen 
= (2na, mb) are represented: by the unit matrix H’, since k = (7/a, 0) 
and exp(ik : teyen) = 1. Likewise the translations [(2n + 1)a, mb] 
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are all represented by —E’. In this way we see that there are 
only the following eight different matrices in any representation of 1 


E’ representing {HZ |teven} 


g 29 {my|t ofr teven} 
if’ 99 {IT |teven} 
m’ ” {Mz \teven } 
=H! ” {Bt + teven } (26.6) 
—g’ % {my|t + t, + teven} 
ae ” {IT|t, + teven} 
—m' ” {mz|t + teven} 


t, = translation (a, 0): t = translation (4a, 0). 
The reflection mz is in the line shown in Fig. 38a. 


These matrices in themselves clearly form a group, but since in our 
group postulates (§ 4) we only have provision for multiplying 
elements and not adding or subtracting them at the same time, 
we must for the moment consider —EH’ as quite an independent 
element form £’ etc., and to show this we shall write #’, g’, IT’, 
m’ instead of —E’, —g’, —II', —m'. The group multiplication table 
is easily obtained from (26.3) and similar relations, for instance we 
have 

GP=h, 9g P=%, GH = (EF), 

(m’)? = (m')*? = EB’ = (I1')? = (I7')?, (26.7) 

m'IT’ = g’, ete. 


In fact inspection shows that the group is isomorphic with the 
point-group 4mm and we obtain the character Table 27. However, 


TaBLE 27 
Character Table for (26.7) 


E! E' q'; G IT’, it m', mn 


DO et et 
-_ 
— 
-_ 
i 
— 
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this character table cannot take account of the relation # — —E’ 
for the reasons already mentioned. In fact some of the irreducible 
representations of Table 27 are definitely incompatible with this 
relation, so that for our purposes we retain only the ones satisfying 
x(Z’) = —x(E’), namely the two-dimensional one, and reject all 
the rest. It should be noted that this additional requirement stems 
originally from the particular k vector which we are considering. 
Thus, finally, we obtain the character Table 28 for the group of X, 
showing as predicted the sticking together of bands in pairs. 


TABLE 28 
Character Table for the Group of X 


Characters 
Irred.| {E|te}  {E|t,+te} — {my|t+te} {I1|te} {mz|te} 
rep. {my|t+t,+te} {I|t,+te}  {mz|t)+te} 
xX, 2 —2 0 0 0 


The notation for the elements follows (26.6). 


Time-reversal symmetry without spin 


In the preceding discussion we have only considered the orbital 
part %(r) of the electron wave functions, and then just added the 
spin functions uw, or u_ at the end (see for instance problem 25.4). 
This two-fold spin degeneracy corresponds to neglecting spin- 
orbit coupling and other spin-dependent effects. In this approxi- 
mation the Schrédinger equation for (r) is a purely real differential 
equation, and if %(r) is an eigenfunction with energy H then, by 
taking the complex conjugate of the whole equation, we see that 
#*(r) must also satisfy the equation with the same energy ZH. The 
transformation ¥(r) to %*(r) is just the time-reversal operation T 
already discussed in § 19, omitting the spin part, and for the present 
we define 


T(x, Y, 2) aaa p* (x, Y, 2). (26.8) 


Since T is not a linear change of co-ordinates in the manner of 
§ 2, we cannot incorporate it into our ordinary group-theoretical 
scheme. Instead we first have to set up all the irreducible represen- 
tations of our group and then test whether or not time-reversal 
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symmetry leads to any additional degeneracy.f If %(r) belongs to 
the irreducible representation D, then Ty transforms according 
to D* which consists of the complex conjugates of all the matrices 
of D. From §19 we have the following three cases. 


Effect of time-reversal symmetry without spin: 


(a) D can be transformed to real form: there is no 
additional degeneracy. 


(b) D and D* are inequivalent: there is an additional 


doubling of the degeneracy, and D and D* always | (26.9) 
occur together as a pair. 
(c) D and D* are equivalent but cannot be trans- 
formed to real form: there is an additional doubling 
of the degeneracy, and D always occurs twice. 


Wigner has given a test (19.22) that shows which case a given 
representation belongs to, but it is not very convenient to apply 
to space-group representations as it stands, because it involves a 
summation over all elements of the group. However, Herring 
(1937a) has simplified the test for a space-group & to the following 
form. 


case (a), 
case (b), (26.10) 


case (c), 


Here Q, is an element of & which turns k into —k, and n is the 
number of such elements. Thus Q,? leaves k invariant and is an 
element of KH, the group of k. The y in (26.10) refers to the character 
of Qo’ in an irreducible representation of K (not of 6). Furthermore 
for space-groups containing the inversion IT, if IT is an element of 
K then the Qy are just the elements of K; if IT is an element of & but 
not of H, then the Qy are the elements II x K. The test is therefore 
very easy to apply once we have the character table for H. There 
is obviously no need to sum explicitly over all elements represented 
by the same matrices, as long as we take one of each type. For 


¢ Clearly from the general considerations of § 6, an additional symmetry 
element can never split an existing degeneracy and at most only produce 
new ones. 
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example for the representation X, (Table 29), we choose one value 
for teyen, Say teven = 0, and use the following eight Q,: 


Q = {#0} {E\t} — {mylr} {my |r + th} 


Qo? = {E|0} {E|2t,} {H|3t,} {£0} 
x(Qo2) = 2 2 —2 —2 

Q. = {I1|0} {II|t}  {mz|0} {ma |t,} 

Q.2 = {B|0} {E|0} {B|0} {B03 
x(Qo?) = 2 2 2 2 


> x(Qo2) = 8, the representation belongs to case (a). 


The test shows we have case (a), and there is no additional degeneracy 
due to time-reversal symmetry. 

By way of a further example, we shall consider the more general 
point Z,k = (x/a, ky), of Fig. 38. The character table for the group 
of Z is derived as for the point X and is shown in Table 29. Herring’s 


TABLE 29 
Character Table for the Group of Z 


Trred. Characters 

rep. {E|0} — {Blt,}—fma|0}—{oma|t,} —{B] tn} 
2, I —l 1 —1 exp tk + ty 
Zs 1 —1 —1 1 exp ik - tn 


Only typical elements are given, the others being easily derivable. The 
notation follows (26.6). 


Time-reversal test 

Q=Ax K {M0} {ty a,|t} (my|e + ty} 

Qo? {E|0} {HO} {Blt} {B|t,} 

Note that in forming Q,?, we have {II|tn}? = {#|0} for any tn, so that it is 

sufficient to consider t, = 0. 

2 x(Qo*) = 0 for both Z, and Zs, i.e. each belongs to case (b) and there is an 
additional degeneracy due to time-reversal. 


test is also applied in the table and shows that each representation 
belongs to case (b), so that time-reversal symmetry gives an extra 
degeneracy and we have pairs of bands sticking together along the 
whole side kz =7/a of the Brillouin zone. This sticking together 
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of bands can also be proved by a simple ad hoc argument. Let 
#(z, y) have wave vector k = (w/a, ky). Taking the complex con- 
jugate of the defining relation (25.5b), we see that 


Ty, = $*(x, y) has wave vector —k (26.11) 


ie, in the present case (7/a, —ky). Thus g7' has wave vector k 
again, and let us suppose that 


gT dé = xf, (« = some constant). (26.12) 


We obtain the two relations 


gT9Ty} = p(x + a, y) = —Y(z, y), 
gTgT' = gT (ap) = atgT ys = a*onp. 


Now «*« cannot possibly equal —1 so that we have a contradiction, 
(26.12) must be false, and g7'y and 4 are two independent functions. 
However, since g and 7 are both symmetry elements, the two 
functions must have the same energy and we have a sticking together 
of bands at Z. 


The effect of spin-orbit coupling 


So far we have neglected spin-dependent terms in the Hamil- 
tonian, written our wave functions in the form 


2% = d(r)u, and d(r)u_, (26.13) 


and considered the symmetry operations as transforming the 
orbital variables and orbital wave function ¢(r) only. Now when 
the Hamiltonian # contains spin-orbit coupling (§ 11), % is no 
longer invariant under orbital transformations alone, but only 
under the crystal symmetry transformations applied to both the 
orbital and the spin variables simultaneously. The symmetry 
group © therefore remains the same as before. However, when we 
apply it to wave functions like (26.13), transforming the spin as 
well as the orbital part, we need some new irreducible representations 
to describe the effect on the wave functions. As discussed in detail 
for the point-groups in § 16, this is because u, and u_ behave 
peculiarly: instead of remaining invariant under a 360° rotation 
like an ordinary function f(r), they change sign. Let us consider 
the point [(k = 0) in the Brillouin zone of the square lattice (Fig. 
33a). The new irreducible spin representations of the group of 
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are shown in Table 30, as given by Koster (1957) who derived them 
by the methods of § 16. In the table the element 2,, etc., is a rotation 
by 180° about the z-axis followed by a change of sign corresponding 
to an additional 360° rotation. 


TABLE 30 


Spin Representations of the Group of I for the Simple 
Square Lattice (Fig. 33a) 


Characters 
Irred. E EB 2 Zz 4, q, Mex, Me Ma, ma 
rep. 3. 43 a8 My, My Ma’ s ma’ 
r, 2 2 0 V2 —v2 0 0 
r, 2 2 0 -v2 v2 0 0 


Time-reversal: each representation belongs to case (c) of (26.15) so that time- 
reversal never gives any additional degeneracy. 


Relationship to the ordinary representations (Table 24): 
r, x DO») = 7, r, x DO) = 7, 
ry, x DOM) = FP, myx DOM) = PF, 
rx DWY= P+ 7, 
Here D(/*) is the representation formed by the spin functions u,, u_, i.e. 
I, in the present case. 


We can now determine what additional splittings the spin-orbit 
coupling will produce. Suppose that in the absence of spin-orbit 
coupling, we have an energy level with a set of orbital wave functions 
¢(r) transforming according to one of the representations I, 
4=1 to 5 of Table 24. The total wave functions (26.13) then 
transform according to ' 
yx per (26.14) 


where we have used D“/®) in a symbolic sense to stand for the re- 
presentation formed by u,, u_ in whatever symmetry group we 
happen to be considering. The usual use of group characters (§ 16) 
shows that in the present case D"/*) becomes I, and that the products 
(26.14) reduce as in Table 30. We see that only in the case of I’, 
is the original four-fold degeneracy (double orbital and double 
spin degeneracy) split by the spin-orbit coupling into two-doublet 
levels. 

With spin-dependent wave functions time-reversal symmetry 
takes the more complicated form (19.5a and b). We again have the 
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three cases (a), (b), and (c), but the physical consequences are 
different. Let a set of spin-dependent wave functions % transform 
according to an irreducible representation D. Then we have (§ 19): 


Effect of time reversal symmetry with spin: 
(a) D can be transformed to real form: there is an 
additional degeneracy and D always occurs twice. 


(b) D and D* are inequivalent: there is an additional 
doubling of the degeneracy, and D and D* always 
occur together as a pair. 


(c) Dand D* are equivalent but cannot be transformed 
to real form: there is no additional degeneracy. 


(26.15) 


Note the reversal of the effect in cases (a) and (c) compared with 
the “without spin” situation (26.9). Herring’s test (26.10) applies 
exactly as before since it is a mathematical property of group 
representations, independent of any applications to degeneracy in 
quantum mechanics. 

With the use of these techniques, it is in principle quite straight- 
forward to determine what splitting effect spin-orbit coupling 
will have on a given band structure H(k). The situation for a 
general point k in the Brillouin zone is summarized in Fig. 39. 


EIk) Ew) 
: “ta (b) — (e) 
-k 0 k ~k [e) k —k ce) k 


Fie, 39. Symmetry of H(k) in an arbitrary direction about the 
origin. (a) In the absence of spin-orbit coupling whether or not 
there is a centre of inversion. The double line denotes the two- 
fold spin degeneracy and E(—k) = E(k). (b) With spin-orbit 
coupling and an inversion centre. There is a two-fold degeneracy at 
each k, and #(—k) = E(k). (c) With spin-orbit coupling in a 
structure without an inversion centre. There is no degeneracy at an. 
arbitrary k, but E(—k) = E(k). The latter gives the two-fold 
Kramers degeneracy (§ 19). These results may be derived from 
(26.9), (26.10) and (26.15), or by simple ad hoc arguments (see 
problem 26.6). 


20 
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The band structures of indium antimonide and germanium 


The crystal structure of indium antimonide is shown in Fig. 40. 
Each antimony atom tends to loose one electron to the indium, so 
that we effectively tend to have In~ and Sbt ions. Each of these 
has four valence electrons like carbon, so that like carbon they form 
four directed covalent bonds in tetrahedral directions as shown 
in Fig. 40. (For a discussion of the covalence of carbon see § 22 
and Fig. 16.) We may therefore expect to form the electron wave 


Fia. 40. Structure of indium antimonide and germanium. The 
double lines show the covalent bonds. The black and white circles 
correspond to the two types of atoms in indium antimonide. Both 
black and white circles correspond to germanium atoms in a germa- 
nium crystal. Notice how the atoms 1 to 5 form a spiral, indicating 
e four-fold screw axis in germanium. (From W. Snocx.iey, Electrons 
and Holes in Semiconductors. D. Van Nostrand Company, Inc., 
Princeton, New Jersey. Copyright 1950.) 


functions 4, out of directed valence orbitals of the type (22.7), and 
indeed detailed calculations of the band structure E(k) can be 
made on this basis (Hall 1952). However, we shall limit ourselves 
to a qualitative discussion of the band structure k = 0. It is then 
more convenient for our purposes to recognize that the directed 
valence orbitals are just linear combinations of the atomic orbitals 


¢s = an atomic 5s function, (26.16) 
dx, dy, dz = atomic 5pz, 5py, 5p, functions, 
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and to use these atomic orbitals instead. From these we can form 
the Bloch functions with k = 0 


Penoo(l) = 2 ds(f¥ — Im), 
tepaoll) = 3 pa(t — Tm), 
ty n=l) = 2 gy(l — Tm); (26.17) 
te n=l) = 2 $(r — Tm), 


where the summation is over all atoms rm. So as not to confuse 
the notation, we have not indicated explicitly that the atomic 
orbitals (26.16) are really slightly different on the two types of atom. 

The crystal structure is seen from Fig. 40 to have the face centred 
cubic translational symmetry, and the Brillouin zone is that shown 
in Fig. 29a. Because we have two types of atom arranged in a 
particular way, the rotational symmetry is not the full “cubic” 
point-group but the “tetrahedral” point-group 43m. There are 
no screw-axes or glides when there are the two types of atom. The 
point k = 0 in the Brillouin zone we call I’, and the group of I 
clearly consists of the whole space group. Its character table is 
given in Table 3] (Dresselhaus 1955), every translation being 
represented by the unit matrix because k = 0 (25.5a). It is now 


TABLE 31 
Group of I’ for the Indium Antimonide Structure 


Characters 
Irred. Ordinary Irreducible Representations 
rep E 6 3 Zz ha 
ry 1 1 1 1 1 
Pr, 1 1 1 —1 —1 
Pr; 2 2 —1 0 0 
% 3 —1 0 ~—1 1 
I, 3 —l1 0 1 —l 
Irreducible spin representations _ 
E B 2 Xs 3 xt 3 3 x 4 ‘0 Md, ma 
LF, 2 —2 0 1 —-I V2 —Vf2 0 
ry) 2 2 0 1} -~-l —-¥2 f2 0 
F. 4 —4 0 =| 1 0 0 0 


relationship between ordinary and spin representations 
I, ‘ : r 1 I. 2 I. 3 L, 4 5 
My x Des) : Vs Fr, r; rt rs e+ ry 
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easy to write down the transformation properties of $s, pz, py, 
sz (26.17). Since each atom is turned into another atom of the same 
type by a rotation and since all atoms in (26.17) have the same 
coefficient (k == 0), we need only consider the transformations on 
one atom. By inspection we see that %, transforms according to 
I, and gz, py, 2 according to I',. This gives a singly-degenerate 
and a triply-degenerate level at k = 0 (Fig. 41a). 


E(k) E (k) E(k) 


(a) (b)} (c) 
k r k Tr l 


Fria. 41. Energy near I in In Sb and Ge. (a) Without spin orbit 

coupling in both In Sb and Ge, each band has a two-fold spin degener- 

acy. (b) Splitting of top of the valence band by spin-orbit coupling 

in In Sb. (ec) Splitting by spin-orbit coupling in Ge, with each band 

being doubly degenerate. The magnitude of the splitting has been 
exaggerated. 


Experimental evidence and detailed calculations show that the 
lr, and I’, levels are respectively the lowest and highest of all the 
occupied valence band, as is understandable for the following 
reason. We have already inferred from the observed crystal struc- 
ture that the Bloch functions 4, will be made up as far as possible 
from directed valence orbitals like (22.7). In this way the lowest 
energy will be achieved on the average. However, we have seen 
that at k = 0 symmetry forbids such mixing of y¥, with yz, py, 
wz, and the levels approximate to atomic ones, i.e. the s-level lowest 
and the triply-degenerate p-level highest. 

We now include the effect of spin-orbit coupling. Including 
spin degeneracy, the I, level is six-fold degenerate, and it splits 
into a doublet (I77) and a quadruplet (I's) (see Table 31) as is easily 
shown by the method described in connection with Table 30. This 
is exactly analogous to the splitting of an atomic p level into a 
4 = doublet and j = $ quadruplet. The magnitude of the 
splitting is also of the same order of magnitude as in an atom, 
because the spin-orbit energy (11.8) is proportional to the electron’s 
velocity which is large only when its potential energy is low near 
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the nucleus. The potential here differs only very little in the atom 
and the solid. In indium antimonide the splitting is thought to 
be about 0.9 electron volts, Since the crystal structure does not 
have an inversion centre, #(k) at a general k has to be in accordance 
with Fig. 39, so that the variation of E(k) in an arbitrary direction 
from I is as shown in Fig. 41b. 

Finally we consider the case of germanium. Here the two types 
of atom of Fig. 40 become identical, so that the crystal structure 
has some additional symmetry elements which include four-fold 
screw axes and a centre of inversion. The type of splitting at 
k = 0 can be obtained simply in two ways. The group of I con- 
sists of the whole space-group again, but since k = 0 the transla- 
tional component of all elements including the screws has no effect 
and the group of I’ reduces to the “cubic” point-group m3m. The 
character table can be obtained (appendix L and Koster 1957) 
and the splitting at k = 0 is found to be qualitatively the same as 
in indium antimonide. Alternatively this can be seen by noting 
that the splitting in the less symmetrical indium antimonide is 
the same as that in the more symmetrical free atom with complete 
rotational symmetry. Now additional symmetry can only produce 
extra degeneracy, so that in the present case any intermediate 
symmetry such as the germanium structure must have the same 
splitting as the two more extreme cases. In germanium the spin- 
orbit splitting is 0.29 eV. However, at a general point in the 
Brillouin zone the inversion symmetry of germanium produces a 
degeneracy between some of the spin bands in accordance with Fig. 
39, so that H(k) looks like Fig. 41c. More detailed calculations of the 
shape of H(k) can be made using the perturbation approach of 
problem 25.18, and the three branches of E(k) at the top of the 
valence band correspond to the possibility of “holes” with three 
different masses. For details of such calculations and a comparison 
with experiment the reader is referred to Kane (1956) and the 
references given there. 


Summary 


In this section we considered first the irreducible representations 
of complicated space-groups © containing screw axes and/or glide 
planes. At any interior point of the Brillouin zone the procedure 
is essentially the same as that of § 25 for simple space-groups, as 
can be shown in greater detail in the manner of problem 26.5. For 
points on the surface of the zone, we pick out one k vector from 
the star of k (all k; transformed into one another by 6), and set 
up K (the group of k) consisting of all elements of 6 which leave 
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k invariant. In any representation, many translations will be 
represented by the unit matrix because k lies on the Brillouin zone 
surface. We lump these all together in K and similarly all other 
sets of elements that are represented by the same matrix. This 
leaves us with a much simpler group, usually a point-group if k 
has a high symmetry or a one-dimensional space-group for more 
general points, whose character table can be looked up or easily 
found. A single irreducible representation of G is then obtained 
by taking several sets of functions, as many sets as there are distinct 
k,’s in the star of k, each set having one of the k; and transforming 
irreducibly under Xj. 

It is found that screw axes and glide planes generally lead to 
necessary degeneracy between bands at the surface of the Brillouin 
zone, this being called ‘‘bands sticking together’. 

In the presence of spin-orbit coupling, the same procedure applies 
for finding the irreducible representations, except that we now 
have to use the double-valued spin representations. In either case 
we always have an additional time-reversal symmetry which may 
lead to extra degeneracy, as can be determined by a simple test 
(26.9), (26.10) and (26.15). The theory has been applied to the 
band structure of indium antimonide and germanium at the top 
of the valence band. 


References 


The irreducible representations of simple space-groups were 
obtained by Bouckaert, Smoluchowski and Wigner (1936) following 
the mathematical theory of Seitz (1936). The method was extended 
by Herring (1942) to space-groups with screws and glides, and by 
Elliott (1954) to include spin-representations and spin-orbit split- 
ting. Herring (1937a and b) discussed the effect of time-reversal 
symmetry and the occurrence of accidental degeneracy. A syste- 
matic presentation of the whole subject, together with a general 
review and many references, has been given by Koster (1957). 
Bell (1954) and von der Lage and Bethe (1947) have applied 
symmetry properties to the types of functions used in detailed 
calculations of H(k). Herman (1958) has given a compilation of 
space-groups whose irreducible representations have been tabulated, 
complete up to 1957. 


PROBLEMS 


26.1 A space group contains a purely translational symmetry 
element t. Show that t must have the form of a lattice displacement 
t, (25.1). Hint: investigate the consequences of having t = $a, or 4a. 
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26.2* Derive the different types of screw axes and glide planes 
that are possible in a space group. In particular, why do we always 
take the translational part of a screw parallel to its rotation axis? 
(Zachariasen 1945, Seitz 1935). 

26.3 Make a model of a diamond or germanium crystal, and 
describe all the symmetry elements in its space group. (See Fig. 
40, also Kittel 1956, p. 36, and Herring 1942.) 

26.4 The general point in the zone. Verify in detail that the 
argument in § 25 for the irreducible representation connected with a 
general k vector in the Brillouin zone, can be applied with very 
little modification to space-groups containing screw axes and glide 
planes. (See also problem 26.5.) 

26.5 Irreducible representations at interior points. G is a space- 
group containing screws and glides, and k an interior point of the 
Brillouin zone. K, the group of k, has elements {R,|t} where the 
translation t may not be lattice translations in the case of screws 
and glides. Show that the rotations {R,|0} form a point-group Px. 
Let the matrices D(R,) form an irreducible representation of 
Px, and show that the matrices exp(tk-t)D(R,) form an irreducible 
representation of H. Describe how this is then built up into an 
irreducible representation of 6. : 

26.6 For a space-group containing (not containing) a centre of 
inversion J7, time-reversal symmetry does not give (does give) 
additional degeneracy at a general point in the Brillouin zone under 
the “without spin” conditions. Show this using the test (26.10) and 
also by an ad hoc argument based on the functions % and ITT. 
Hence show that with both kinds of space-group E(k) — E(—k). 

26.7 Show that with a space-group containing the inversion J7 
and in the “without spin” situation, the orbital wave functions 
#(r) can always be so chosen that Tz = yz. 

26.8 Accidental degeneracy. A given crystal contains a centre 
of inversion. Show that it is impossible for two zones to be accident- 
ally degenerate at a whole surface of general points in the Brillouin 
zone, but that it is possible for accidental degeneracy to occur 
along a curve running through the Brillouin zone. (Note that we 
are excluding all special k-vectors from consideration here.) Hints: 
(i) Suppose degeneracy occurs at one point k. Using problem 
25.18, set up the secular equation 


A—E C = 
c+ B—E| 
ror the energy E at a neighbouring point k + Sk. (ii) Using problem 
26.7 show that C can be made real. (iii) In the manner of (22.16), 


0 
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obtain the conditions A — B =0 and C=0 for degeneracy to 
occur at k -++ 8k, and show that these two conditions define a 
direction for 5k. 

26.9 Write down the representations of the whole space-group 
Pol derived from Z, and Z, of Table 29, and verify by inspection 
that they belong to case (b) in agreement with the results of Her- 
rings’s test. 

26.10 Write down the irreducible representation of the two- 
dimensional space-group Pb (Fig. 42) associated with k = (x/a, 
ky), Show directly from first principles that it belongs to case (c) 
and verify with the test (26.10). 


SoBe ee eee ee ee — eae pe ee ine ae ee oe oo 
a ‘és oon | we | se) 
| 
| o* | ~~ | 
e e e 
EQ SE I Ne ee ie oe ae + ee ee pas 
‘“ re H ae | aw i! 
Seren = SSS SS Seen een peter age ms +— 
Pb Pba 


Fia. 42. Patterns exhibiting the two-dimensional space-groups 
Pb and Pba, with cell-size a x b. The broken lines denote the glide 
planes. 


26.11 Discuss completely the sticking together of bands in the 
case of the two-dimensional space-group Pba (Fig. 42). 

26.12 Show that a two-fold screw axis perpendicular to a face 
of a Brillouin zone (in three dimensions) gives a double degeneracy 
‘over the whole of that face (i) by a direct ad hoc argument and 
(ii) using Herring’s test (26.10). This is the only situation in which 
sticking together of bands can occur over a whole face of the Bril- 
louin zone: in all other cases it is along lines on the surface of the 
zone. 

26.13 Derive all the irreducible spin-representations of the space- 
group of Fig. 38. Will spin-orbit coupling destroy the sticking 
together of bands? 

26.14* Derive from a simple model a picture of the band structure 
of graphite, and discuss qualitatively whether any additional 
splittings would be expected on going to a better approximation 
and on including spin-orbit coupling (Nozieres 1958 and references 
given there). 

26.15* Write a review on the use of the perturbation method of 
problem 25.18 (k-p method) in theoretical solid state physics 
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for calculating band structures, effective masses, degeneracies, 
and general theory such as <v>) =f-! grad,H. (Kane 1956, 
Cohen 1959, Herring 1937b). 

26.16* Discuss the effect of crystal symmetry on the thermal 
conductivity at low temperatures (Herring 1954). 

26.17* Discuss the application of group theory to determine 
the number of exciton lines in the absorption spectrum of an insula- 
tor. How does this theory apply to cuprous oxide? (Kittel 1956, 
Overhauser 1956, Gross 1956). 

26.18 The bismuth structure has point-group symmetry 3m 
and rhombohedral translational symmetry with two atoms per 
unit cell at (0, 0, 0) and (u, u, w) where w does not bear any relation 
to the cell size. Describe the Brillouin zone (see Mott and Jones 
1936, Fig. 69 and not 70). It is found experimentally that the 
bottom of the conduction band can be fitted by three degenerate 
ellipsoids 


E(k) = (h?/2m )(aykz® + agky® + agkz® + Lagkyke) 


with all other coefficients zero. Here x and z are chosen along a 
diad and the triad axes, and kz, ky, kz are the co-ordinates of k 
relative to the unknown centre of the ellipsoid. From the form of 
E(k), show where in the Brillouin zone the centres may lie, and 
discuss whether symmetry requires the existence of three or a 
multiple of three such ellipsoids. 

26.19 An X-ray of wave vector k, is incident on a crystal, and 
the direction of the diffracted ray is given by k, where |k,| = [k,]|. 
Show that the diffracted amplitude is proportional to 


| e(r) exp #(ke, — k,)-r do 


where p(r) is the electron density in the crystal and where the 
integral extends over the whole crystal. Hence, show that the 
diffracted intensity is zero unless k, — k, = Kym, a reciprocal 
lattice vector. For this reason an X-ray diffraction photograph 
consists of a number of spots which can be labelled by the integers 
(m,, Ms, Ms) of (25.6). If a crystal has a two-fold screw axis parallel 
to the vector K,, show that the intensity of the spot (m,, 0, 0) is 
zero if m, is an odd integer. This is called an extinction. What 
extinctions would you expect in a cubic crystal containing a glide 
plane perpendicular to K, with glide direction parallel to K,? 
Deduce also the Bragg relation 2d sin 8 = nA from this formalism 
(Kittel 1956). 


i) 
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27. Tensor Properties of Crystals 


Introduction 


In anistropic media like crystals, various properties of matter 
have to be represented as tensors. For instance Ohm’s law takes the 
form 

J=a'€ 


where the conductivity of the material is represented by the second 
order tensor 6 with components 


Gze Tzy Vz 


Now in a material with no symmetry properties such as a triclinic 
crystal, these components are all arbitrary (apart from the physical 
requirement oy = oj). But when we go to the opposite extreme 
of a cubic crystal, the two-fold and four-fold symmetries about the 
x, y, and 2z-axes impose the requirementst 


Cxze = Cyy = O72 — GC say; 


oy = 0,44). 
Thus the conductivity tensor reduces to the form 
a0 0 0 
0 « 0 
0 0 ca 


with three non-zero components of which only one is linearly 
independent. 

Similarly many properties of matter form various types of tensor 
of different orders, for instance the elastic modulus, third order 
elastic constants, magnetoresistances, piezoelectric constants, mag- 
netostriction constants, etc. The problem is to determine what 
forms these tensors must have for crystals belonging to the 32 
different crystal classes described in § 16. In particular we wish to 
know which components are zero, and what are the relations between 
the non-zero ones. A variety of different group theoretical approaches 


t This can be proved for instance as follows. Suppose we apply a field 
& = (&, 0, 0) along the z-axis. The component of the current along the 
y-axis is oyz@. Because of the two fold symmetry about the y-axis, this 
component must remain the same if we apply the same field along the negative 
x-direction, ic. & = (—€, 0, 0). Hence oyz? = —ayzr& and cyz = 0. 
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have been used to discuss this problem. In practice it is probably 
quickest to use a combination of these depending on the circum- 
stances, e.g. whether one wants to discuss a particular point group 
or several related ones, whether one wants to discuss a high order 
tensor ab tntivo or one has available for use the results for lower 
order tensors, and whether one wants the full form of the tensor 
or just the numer of independent constants in it. It is therefore 
not possible to give an exhaustive treatment, but we shall illustrate 
as many of the ideas as possible by an extensive discussion of a 
particular example, namely the elastic constants for a crystal of 
symmetry 422. 


Strain, stress and the elastic constants 


The strain in a body is the amount by which it is locally deformed. 
Let i, j, k be unit vectors along the z-, y- and z-axes, and 


e=ui+oj + wk (27.1) 


be the displacement of an arbitrary point r from its undeformed 
position. The strain at that point is then specified by the six strain 
components 


ou ov Ow 
Cra = a? euy By? Cxz = a? 
1/dv ou 1/dw , o& 
toy = te =5(— + Fe), en ew = 3 (a + Zh: 
l/ou dw 
Cre = Cza = AS i =}: (27.2) 


Many authors omit the factor 4 in their definitions of ezy, eyz, ézz. 
We refer to Kittel (1956, p. 85) for a helpful discussion of the strain 
components in physical terms that can be visualized rather more 
easily than the formal definitions (27.2). 

The stresses in a body are the forces which tend to deform it. 
Consider a square surface of unit area perpendicular to the x-axis, 
and let 

F, = (Fizz, Fry, F'zz) 


be the force acting across this surface. By this we mean that if we 
cut away the material on one side of the surface, we would have to 
apply a force +Fy, to the material on the other side to keep it in 
equilibrium. The + depends on which side we cut away, the + 
sign applying if we cut away the material in the positive x-direction 
from the surface. Similarly we define the other stress components 
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Fyz, Fyy, Fyz, Pex, Fiy, Fiz. We now consider a unit cube of 
material and use the condition for equilibrium that the sum of the 
moments of the forces must vanish. This gives (Fig. 43) that 


Fry = Fyz, Fyz = Fry, Faz = Fz. 


We therefore have six independent stress components Fizz, Fyy, 
Fz, Fry, Fz, Fyz. 


a> Fy 
f hy 
~fy O x 
———_—a 
—Fyx 


Fia. 43. Demonstration that Fz, = Fyz for the total torque about 
the origin to be zero. 


In discussing stress and strain components, it is sometimes con- 
venient to employ an alternative notation. This replaces the double 
sufficies like xy by a single one 1 to 6 as follows: 


1 for xx, 2 for yy, 3 for zz, 

4 for yz and zy, 5 for xz and zz, 6 for xy and yz. (27.3) 
Thus the stress and strain components will be written as-epg, Fipg 
or e, F' where we shall always use p, q, 7, s for suffices taking the 
values x, y, z (or 1, 2, 3) and ¢, 7 for suffices running from 1 to 6. ~ 

In a material obeying Hooke’s law, the stress and strain com- 
ponents are related by the constants cj; known as the elastic con- 
stants or the elastic stiffness constants. 

Fy = cyej, (j summed) 
or Fpq = Cpersers. (7, 8 Summed) (27.4) 

By considering a small cube and the work done on it by all the 
forces F'pg, we obtain that the total work done per unit volume 
in an arbitrary small strain deg is 

8U = Federer + Pyydeyy + FPzz2dez2 + 
+ 2 BF yzdeyz + 2 F 2782 + 2F zySezy- 


- 
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We have 0U/ée, = F, and 0U/ée, = F,, and on further differentiation 


Hence from (27.4) we obtain 
Coy = Cy. (27.5) 


This reduces the number of independent elastic constants from thirty- 
six to twenty-one. 


Transformation properties 


From (27.4) it follows that the cygrs transform under rotations 
as the components of a fourth order tensor, i.e. in the same way 
as the products pgrs. This is an immediate application of a funda- 
mental theorem of tensor analysis (Milne 1948, p. 42), and can 
also be proved directly as follows. Suppose the co-ordinates x, y, 
z transform according to 


Rp' = Dpp'p — (p, p’ = 4, y, 2) 
(p summed) 
under a rotation R. D is a unitary matrix and is real, whence D-) 
= D. It can easily be shown that 2/@x, 2/dy, /éz transform like 
x, y, z, and thus from (27.2) the strain component epg transforms 
like the product pq according to Dpp'Dgq’. Similarly F'pg transforms 
in the same way. Each side of (27.4) therefore transforms according 
to 
Doyp' Dag’ 5r'r" 55's" — Doygrsp'g't' s'Drr’ Des’, 


where Cpgrs transforms according to Dygrsp'g'r's'. Multiplying each 
side by Dye Dang = Dyhuu Dyer and summing over r”, 8", 
we obtain 
Doqrsp'g'r's' = Dpp' Dgq' Drr' Ds’. 
Thus 
Cyars, PATS, Cpgerss (27.6) 


all transform in the same way. 

We now discuss the transformation properties in more detail to 
take into account the fact that the tensors egg, Fryg, cy are all 
symmetric 


Cpg = gps Fpq = Fop, Cy = CH (27.7) 
The components of an arbitrary tensor tpg transform like the nine 
products 142%,, XY, X42, . . . according to 


DY xy DY — D®@ 4+ DM + Yo, 
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The linear combinations that transform like standard base vectors 
are given in (9.4), and we notice that the linear combinations trans- 
forming according to D“) are antisymmetric and form the three 
components of an antisymmetric tensor (which is the same thing 
as a pseudo-vector (Milne 1948)). They therefore cannot apply to 
the epg which form a symmetric tensor, and which transform like 
the other base vectors, i.e. according to D@) + D), The separation 
into symmetric and antisymmetric components is not accidental 
and can be deduced in a more general way as follows. As in § 9, 
we consider the products umvs transforming according to D‘ 
x D® and count up the number of times each particular value of 
M=m-++» occurs. However, this time we do not consider umv, 
and u,0m separately, but count the symmetric sum wmv, + U,0m 
once only. Thus the highest values M = 2j and M=2j—1 
occur once only, M = 2) — 2 twice, etc., and we have 


DP x D (symmetric product) = D@) + D@-® 4... + DO, 
(j = integer) (27.8) 

in contrast to the usual formula (9.2). For a discussion of sym- 

metric products from a slightly different point of view, see equation 


(24.39) et seq. We now see that the symmetric tensor epg trans- 
form according to the symmetric product 


D® x D® (gym) = D@ + D©, 


and from (27.6) the cpgrs transform like the symmetric products 
Cngers according to 


[D® + DO] x [D@ + D0] (sym) 
= [D® x D®] (sym) + [D® x D® + D®© x D2] (sym) + 
+ [D x D0] (sym) 
=[D® + D@ 4+ DM] + De® + Do, 
(27.9) 
This gives twenty-one independent base vectors, just equal to the 
number of elastic constants. 
We can now write down from (27.9) the representation that the 
ci give under some point-group of rotations (§ 16). Consider for 


instance the group 422, whose irreducible representations are given 
in appendix K. In (27.9), 


D becomes 2A, + A, + B, + B, + 2H, 
D™ becomes A, + B, + B, + E, 
D) becomes A,, 
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where these decompositions follow from (14.2) and (14.4). Thus the 
ey transform according to 


6A, + A, + 3B, + 3B, + 40. (27.10) 


The consequences of rotational symmetry 


Let us assume that the material under discussion has a group 
& of rotational symmetries. In the case of an isotropic material 
G is the full rotation and reflection group, and for crystals G is one 
of the point-groups of proper and improper rotations. By definition, 
a rotation & of the group moves the co-ordinate axes to a completely 
equivalent orientation, so that the properties of the material are 
the same expressed in terms of the two sets of co-ordinates. Thus 
if the tensor T represents some property of the material, the value of 
each component Tygr . . . of the tensor must be invariant under the 
group of rotations ©. This we shall refer to as the fundamental 
theorem of the present section. 

We shall now consider the linear combinations 


Di = cy y0y, + eyeye +... + XeeCeg (27.11) 


which transform as standard base vectors according to the re- 
presentations D‘*) in (27.9), or its equivalent like (27.10). We have 
two ways of looking at the transformation properties of the L,,'*). 
On the one hand they transform by definition according to D. 
On the other hand by the fundamental theorem each cy in (27.11) 
is invariant so that L,, is invariant. Now a set of non-zero 
quantities cannot transform according to two different irreducible 
representations at the same time (Theorem 3, appendix C). We 
therefore conclude (i) if D@ is not the identity representation 
(D® in (27.9) or A, in (27.10)), then Ly‘) =0; and (ii) if D@ 
is the identity representation, then Lm“) may be some non-zero 
constant. In the case of complete rotational invariance the identity 
representation D occurs twice in (27.9), so that (27.11) is a set of 
21 equations which determine the cy in terms of two constants. 
This is the well-known result that an isotropic medium has only 
two independent elastic constants, namely the bulk modulus and 
the rigidity modulus. 

A similar argument applies if we consider the case of a crystal 
with point-group symmetry 422. The identity representation A, 
occurs six times in (27.10). The elastic constants cj are therefore 
expressible in terms of six constants. There now remains the 
problem of actually determining the values of the cy in terms of 
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six suitable constants, in particular determining which cy are 
zero. This could in principle be done by following the present line 
of argument. The correct linear combinations (27.11) could be 
constructed by using the projection operator (14.lla) on one cy 
picked at random, and the equations (27.11) could then be solved 
for all the cy. This however would be extremely tedious, and 
instead we shall start again at the beginning and use the “direct 
inspection” method. 


The direct inspection method 


We saw in (27.6) that the elastic constant cpgrs transforms in 
the same way as the product pgrs, ©.g. Cazyz like xyz. In fact for 
convenience we shall use the products to represent the Cygrs. The 
order of the factors pgrs is important, except that because of (27.7) 
we may interchange the first two, the last two, or the first pair with 
the last pair, i.e. we put 


pgrs = gprs = pqsr = rspq. 


We shall follow the convention of alphabetic order in each pair 
pq and rs, and not list these equivalent combinations separately. 
The elastic constants cy; then transform like the following products: 


at, y4, 24, yzyz, weuz, cyxy, 
xPy?, a22?, xyz, x8z, wy, 
y?2*, yz, yPaz, yay, 22, 
zaxz, cry, Y2xz, YAY, Lexy. 


(27.12) 


The group 422 contains a 180° rotation about the z-axis, i.e. the 
transformation 
Z>-#, Yr-Yy 2% 


Under this transformation the product z*z becomes —2x*z. However 
by the fundamental theorem we require each product to be invariant, 
so that we have 

xz = 0. 


Similarly any product in (27.12) is zero if it contains z to an odd 
power. The two-fold rotations about the z- and y-axes eliminate 
products with odd powers in x or y. Thus all of the products (27.12) 
are zero except 


at, y4, 24, yeyz, xexz, cycy, wy®, 2°22, y2? (27.13) 
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We next consider the effect of a 45° rotation about the z-axis, 
u>y, y > —z2, Ze, (27.14) 


This gives the following relations: 
of = 74, YYZ = U2XZ, a al ar tal 


We now have three relations among the nine non-zero components 
(27.13), so that these are expressible in terms of six constants. 
This is the number already deduced from (27.10), so that there can 
be no more independent relations among the components, as can 
easily be verified by applying all the rotations of the group 422 in 
turn. The matrix cy of elastic constants therefore has the form 


C1 C2 Gs 09 0 O 
Cryo Cy, Cg 9 0 O 
Cz %g Cyg 0 O O 


0 0 0 cy 90 0 
0 0 0 O egy 0 
0 0 0 O O && 


(27,15) 


in terms of the six independent constants ¢,,, Cio, C13, Cag» Cagr Cag: 

This method of direct inspection is clearly applicable to working 
out the components of any tensor, provided we can choose a natural 
set of x, y, z-axes such that each rotation of the group sends a 
co-ordinate axis into a co-ordinate axis like in (27.14). Thus trigonal 
groups with 120° rotations about the main axis require special 
consideration (Fumi 1952b, c). 


Summary 

We have shown how the rotational properties of tensors are 
derived. From this we can determine the number of independent 
constants, in terms of which the tensor components can be expressed. 
It is also possible to determine the actual scheme of the com- 
ponents, which ones are zero and which non-zero ones are related. 


References 

General references: Fumi 1952a and Hearmon 1956, and refer- 
ences given there. The direct inspection method: Fumi 1952b, 
1952c. Elastic constants, including third order elastic constants, 
Hearmon 1956. 
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PROBLEMS 
27.1 The piezoelectric constants @gys are defined by the relation 


Pg = Ggrsers, (r, 8 Summed) 


where FP is the electric polarization in the material produced by the 
strain e,-s, and where g, 7, s run over the indices 2, y, z. Show that 
Ggrs = 0 for any crystal having a centre of symmetry. Determine 
the scheme of piezoelectric constants for a crystal with symmetry 
422, and check the number of independent constants by a second 
method. 

27.2* Determine the form of the matrix cy of elastic constants 
for a crystal having symmetry 32 (Fumi 1952c). 

27.3* Review the different methods that have been used to 
calculate the scheme of components in tensors subject to some 
point-group of rotational symmetry. Discuss their usefulness and 
their limitations. For references see Fumi (1952a). 

27.4* Most tensors like cy that represent properties of matter 
are either symmetric or antisymmetric. This is called their intrinsic 
symmetry. Review and relate the different types of argument that 
are used to establish the intrinsic symmetry of tensors and other 
quantities, e.g. conductivity, dielectric constants, the potential 
at A produced by a charge at B and vice versa, the elastic constants; 
and show how macroscopic reversibility is involved in the case of 
static properties. Discuss the statement ‘“Time-reversal symmetry 
is responsible for the intrinsic symmetry of matter tensors” (Fumi 
1952a, p. 740). . 

27.5 In which of the thirty-two crystal classes can a crystal 
exhibit (i) the piezoelectric effect and (ii) the pyroelectric effect? 
The piezoelectric effect is defined in problem 27.1. In the pyro- 
electric effect a crystal exhibits a permanent spontaneous electric 
dipole moment (Cady, 1947). 


Chapter VII 
NUCLEAR PHYSICS 


28. The Isotopic Spin Formalism 

Isotopic spin 

Since nuclei contain two quite separate types of particle, protons 
and neutrons, the wave functions describing them become one 
stage more complicated than the wave function for the electrons 
in an atom. The most obvious way of distinguishing which co- 
ordinates ry, oz, in a wave function % refer to which type of particle, 
would be to use always the co-ordinates k = 1 to p for describing the 
p protons, and the remaining k = p +- 1 to p + n for the n neutrons. 
However, such a system turns out in some ways to be rather clumsy, 
and a more convenient description is obtained by using an additional 
wotopic spin co-ordinate 7, = +4 to distinguish -whether a particle 
with orbital and spin co-ordinates r, oz is a proton (rz = +4) ora 
neutron (7z = —4). In a way we are considering protons and 
neutrons to be two states (tz = +4) of one “particle”, the nucleon. 
The fact that neutrons and protons are rather alike in their masses 
and interactions makes such a description particularly useful, but 
the isotopic spin formalism is in itself quite independent of any 
assumptions about the degree of similarity between them, e.g. as 
regards nuclear forces. 

Following the same argument as in § 11, we introduce two basic 
functions of the co-ordinate 72, 


€_(72) = 0, z= 4; 


= I, i Ste —4, 


(28.1) 


so that any function of 7, can be expressed in terms of these two. 
In particular if we multiply ¢, by 7z, we obtain the function 


Thus we can write 


TE, = $8,, t2f_ = —t¢_, (28.2) 
313 
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and consider 7z as an operator as well as a co-ordinate, and this 
can lead to no confusion just as in the case of the orbital variables 
x, y, 2 or the spin variable oz (problem 11.2). We also consider 
unitary transformations of ¢,, &_ 


TE, a fb ae (summed), 


which define 2 x 2 unitary matrices. Now all 2 x 2 unitary 
matrices with determinant +1 form the representation D/2) of 
the rotation group (cf. equation (8.24), and problem 8.13). It is 
therefore convenient to describe these transformations of £,, &_ 
using the language of rotations in a fictitious three-dimensional 
isotopic spin space with axes Ox, Oy, Oz. In these terms the trans- 
formations Iz, I,, I, defined by the properties 


Ip, = 36, Lyf, = HE, TE, = $8, 
Ir = $6,, Tyg =—HE,, LE =—té,  (28.8a) 


must be interpreted from (8.18), (8.23) as the infinitesimal rotation 
operators. In particular comparison with (28.2) gives I, = tz, and 
we shall in future write tz, Ty, 7, T- for Iz, Iy, Iz + Uy. Tt follows 
therefore from (11.18) and (8.29) as in § 11 that rz, ry, 7z transform 
like and have the properties of an angular momentum vector, called 
the isotopic spin, with the difference that we have dropped the A. 
From (28.3a) these operators are represented by the matrices 


nal wali af! 
nem a =|} |. (28.3) 


The interactions of nucleons can now be expressed in terms of 
these operators. E.g. the electric charge of a nucleon is 


q = (4 + re)e. 
For clearly we have 
gf, = e€,, gé_ = 0 
so that the expectation value of g is e for a proton state and 0 for a 


neutron state. The Coulomb repulsion between protons can then be 
written 


Ho = 2,2 ($ + tex)(d + T2)e?/rei (28.4) 


where the summation includes all nucleons. There is therefore no 


NUCLEAR PHYSICS 315 


difficulty in distinguishing protons and neutrons in the isotopic spin 
formalism when required. 
For more than one nucleon, we have 


Sy aa Lan id (28.5) 


as the components of the total isotopic spin vector T (cf. equation 
(8.34)). Wave functions for several nucleons can be written as 


= Pupy . ae. (ri; Oz1, Te, Oza,.. -)Ex1&8,08y8 eeey (28.6) 
apy... 


where «, B, y...is some arrangement of + and — signs (cf. equation 
(11.11b)). Using (28.3), (28.5) we can apply rotations in isotopic spin 
Space to these wave functions, and sort them out into ¢(7', M7) 
transforming according to the irreducible representation D‘T) of 
the rotation group { as standard base vectors (8.18). Thus the 
functions ¢,, €. transforming according to D/2) are characterized 
by the quantum numbers r = }, m, = +}. For any wave function 
¢ we have by operating with 7, that 


Up = 2 m,, = (p — n) = p —}A, (28.7) 


where A = p + n is the total number of nucleons described by ¥. 
Thus for fixed A, Mp is a measure of the nuclear charge pe. In 
any ordinary situation we know A and the total nuclear charge for 
a system, so that we can describe it by a wave function % with a 
definite value of Mr, though ys may contain components with various 
values of 7. In particular we can form pure isotopic functions 
&py as linear combinations of the ExEpofyg -... For instance for 
two nucleons we have 


Fir = Exess, Fo = 2-ME(E ee + E1E,9), 
Biia=Eabs, Fy = 2-/E ef .— Eaf,,). (28.8) 


The exclusion principle 


We shall now discuss how the exclusion principle is to be expressed 
in terms of our isotopic spin formalism, and start by considering 


t The quantum number 7 should not be confused with the vector T 
(V2, Ty, Tz). We shall use capital letters to denote total quantum numbers 
and operators for several nucleons, and lower case letters for individual 
nucleons, as in chapter IT and (28.5). 
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two nucleons, a proton and a neutron. Let the proton be in the state 
%(r, oz) and the neutron in ¢(r, oz). The combined state is 


Pp, Ozp)P(En, Ozn)- (28.9) 


and in terms of isotopic spin this can be written 


P01, o21)P(Le, Oza) rf» 
OF. Ye, ze) P(K1, O21) E4081, (28.10) 


depending on which particle we call number 1. Now these two wave 
functions (28.10), which mathematically are linearly independent 
functions, describe only one and the same physical state, namely 
(28.9), and not two separate degenerate states. Moreover the same 
state can also be described by any linear combination of the functions 
(28.10), for example by the antisymmetric one 


P = $(1)$(2)E ab» — W(2)P(1)E,2E 1, (28.11) 


where for short 4(1) = (11, o2,) etc. Clearly this redundancy among 
the wave functions (28.10), (28.11) is undesirable because it makes 
for ambiguity in writing down a wave function for a given state. 
We shall therefore make the convention that we shall always use the 
antisymmetric wave function (28.11), for there is always one, and 
only one, of these. 

This convention is also convenient in another respect. Using the 
spin functions (28.8), (28.11) can be written 


Y =Y(T=1, Mp=0) + Y(T=0, Ur=0) 
where 

Y(T=1, Mr=0) = 2-¥[f(1)$(2) — (2)$(1) Fo = P1410, 
Y(T=M p=0) = 2-M[p(1)P(2) + H(2)$(1)] Fao = PoFo.o- 
(28.12) 

Here we have separated out the 7’ =1 and 7 =0 components. 
Since permutations commute with isotopic spin rotations, permuta- 
tions cannot mix the two components with JT = 1, T = 0, each of 
which must therefore be antisymmetric as can easily be verified 
explicitly. Let us consider the function ¥4,,9. Operating on it with 
T., we obtain using (8.18) ‘ 
TPF) = 2H, 4,1, (28.13) 


which again must be antisymmetric. Since 7’, increase the value 
of Mp by one, by (28.7) it turns a given wave function Y(M7) 
into a new function ¥(Mry-+1) describing a state in which one 
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neutron has been turned into a proton, as is obvious from (28.7). 
Moreover the two protons described by (28.13) are seen to be in 
the same orbital and spin state ¥, as the neutron-proton pair in 
Y,5,,9. It is now clear what advantage there is in choosing F519 
antisymmetric by convention: it means the n-p state V,&,,, is 
directly and simply related by (28.13) to the corresponding p-p 
state, where the latter has to be antisymmetric because of the 
exclusion principle. 
The above argument is quite general. Let 


ply, 043 --. 5 Tp, op) 
and Pn(C psi» Sys) ++ + 5 Mpans Spin) 
be space and spin functions for p protons and n neutrons. By 
permuting co-ordinate numbers, the combined state can be des- 
cribed by up to (p+ 2n)!/(p!n!) different wave functions in the 
isotopic spin formalism. However, by convention we always choose 
the one and only antisymmetric function 


P=[(p+ n)tj-¥2 2, SpPbpdné ss ce €,,0$_,p41 cee E_,pin 


(28.14) 
where [(p + n)!]}-2 > dpP 
P 


is the antisymmetrizing operator of (12.2). The function clearly 
satisfies the requirements of the exclusion principle, namely anti- 
symmetry under the permutation of proton co-ordinates alone, or 
of neutron co-ordinates alone. In addition the function is anti- 
symmetric under permutations of proton and neutron co-ordinates 
in the sense of the isotopic spin formalism. This additional anti- 
symmetry, besides conferring a certain uniqueness on the function. 
has the following advantage. By operating on (28.14) with the 
operators T', and T_ we can generate new functions having different 
values of My, and hence describing different numbers of protons 
and neutrons, say p’ and n’. Simultaneously, the detailed require- 
ments of the exclusion principle have changed, since we now want 
antisymmetry in the first p’ co-ordinates instead of the first p, 
and in the last »’ instead of n co-ordinates. However, since permuta- 
tions commute with isotopic spin rotations, the new functions are 
still of the totally antisymmetric type (28.14) and thus satisfy the 
exclusion principle automatically. Therefore requiring the wave 
functions to be antisymmetric in everything prevents us from ever 
violating the exclusion principle in any calculation. The fact that 
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functions of the form (28.14) transform under isotopic spin rotations 
into functions of the same antisymmetry type allows us to sort 
them out according to the irreducible representations D‘7 of the 
rotation group. Then the set of functions 


wT, Mr) -T <Mr<T (28.15) 


describes a fixed number of nucleons in a fixed orbital and spin 
state, with the number of protons differing from state to state as 
given by (28.7) and with each state satisfying the exclusion principle. 


An alternative formulation 


We shall now sketch briefly an alternative method of setting up 
the wave functions. Apart from generally clarifying the isotopic 
spin formalism and being of practical use on occasions, this formula- 
tion is of interest in two connections. Firstly, there is the historical 
interest. From what has been said in this section, there is evidently 
a complete analogy between isotopic spin and the ordinary electron 
spin. In the latter case the exclusion principle demands anti- 
symmetry in the wave function for electrons with parallel spin, 
but for antiparallel spin the usual antisymmetry requirement is of 
a more formal nature and does not for instance prevent the two 
electrons being in the same orbital state. In § 12 we showed, using 
determinant wave functions following Slater (1929), how to work 
out the different terms arising from a configuration and write down 
their wave functions. However, before Slater’s method was dis- 
covered, the same results were obtained by the much more cumber- 
some method to be outlined below (cf. problem 28.4). It is described 
in detail in Wigner (1931) and Weyl (1931). 

Secondly, this formulation does shed more light on one interesting 
question which was already raised in § 12. In the case of atomic 
energy levels, the energy of a term is calculated purely from #orn 
(10.2) without any spin-dependent forces. Why then does the 
energy depend on the spin quantum number S? In the nuclear 
case the analogous situation is as follows. The isotopic spin only 
distinguishes neutrons from protons, and in the next section we 
shall assume that nuclear forces are, to a good approximation, 
charge independent, i.e. are the same for protons and neutrons, 
Then 7’ becomes a good quantum number, but how can the energy 
of a state depend on 7? 

As in (28.14) let 4, and ¢» be antisymmetric orbital and spin 
wave functions for p protons and ” neutrons. Then any of the 
(p + n)!/(p!n!) distinct functions Pp¢n also describe the same 
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- combined state, where P is any permutation of the p+ 2 co- 
ordinates. All the functions P,¢, form a vector space which is 
invariant under the permutation group P4 where A =p+7n is 
the total number of particles. Let us suppose this space has been 
reduced into irreducible representations 4”) of D4, with base vectors 
Ww, Consider now all the products €,;£s0¢,, .. . . of (28.6). These 
transform into one another under Py, and under isotopic spin rota- 
tions, and from § 15 we can therefore find linear combinations 
Ey;'™ arranged in rectangles so that they transform according 
to A under Py in each row and according to D) in each column. 
For instance for three nucleons we have the rectangles %, D(3/2 
and I’, D“/2), where ¥ and I are the representations of Table 3. 


3/2, ¥ 
“3/21 — Eb sefig 
BF — BE eof 4g + Ee big HE rb sokss) 
BF — 3-ve eof. + Et el g + Eure of 8) 
5738/2, F 
—3/e,1 = a ae ae 


ByET = 2-V(Ef.06 5 — ist of ss) 
aa = = 2- Uae sf bg €_1£ of19) 
Sues = ONE Ee 5 + Erb bis — 26 1,045) 
E_U2T = 6-2 fog — E abial 9 + 26 rE of 9) (28.16) 


Corresponding to any representation 4 of Py, we can define 
another representation 4°°")) by the matrices 


Ay ©°2)(P) = 8p[4y(P)]*, (28.17) 


where as before 5p = +1 as the permutation P is even or odd. 
Because of the 5p in 28.17, this is not just the complex conjugate 
representation 4*. For instance, for D3, % and (Table 3) are 
a conjugate pair, whereas [(©°)) is equivalent to I. If fi and 
gi ©°0)) transform according to 4) and 4cen), then from 
(28.17) and the unitary property (equation (A.20), problems C.2, 
C.1) we have that the linear combination 


> fige comp (28.18) 
i 


is antisymmetric. Now the orbital and spin functions %; represent 
definite numbers of protons and neutrons, so that to obtain a 
corresponding wave function in the isotopic spin formalism we have 
to multiply it by a & with the appropriate Mp (28. 7). However, 
among our rectangles we have a large number of 5’s with a given 
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Mr, and there is no compelling reason to choosing any particular 
one. This situation corresponds to the redundancy in (28.10), 
(28.11). However we can obtain a convenient unique function 


F => POSTON, (28.19) 
+ 


which by (28.18) is antisymmetric. By further detailed arguments 
it can be shown that each 7 is associated with only one A conj and 
vice versa, and that the representations 4 which turn up among the 
¥, are in fact such that there is a Aconj among the rectangles 
of &’s. Since we started with an arbitrary state yy» ¢n, we conclude 
that there is always in the isotopic spin formalism a totally anti- 
symmetric wave function. Hence stating the exclusion principle 
in the new form (antisymmetry with respect to all co-ordinates) 
does not eliminate any physically relevant functions, and gives us a 
unique one where there would otherwise be an ambiguity. 

We now return to the question of energies. Let us assume as is 
approximately true that the forces between all nucleons are the 
same, irrespective of whether they are protons or neutrons. The 
energy of a state then depends among other things on how close the 
nucleons can get to one another. This depends on the representa- 
tion 4 of the orbital and spin function ¥%%). For instance it was 
_ Shown below (12.6) that if 4 is the completely antisymmetric 
representation, then there is zero probability of finding two nucleons 
(protons or neutrons) at the same place. If on the other hand 4” 
is completely symmetric, the probability is a maximum. Thus the 
energy depends on ». However we do not usually compute X for 
our wave functions, but if we always work in the isotopic spin 
formalism with the totally antisymmetric functions (28.19), then 
A uniquely determines A conj, which in turn uniquely determines 7’. 
Thus we can conveniently use the quantum number T to distinguish 
levels which in general have different orbital symmetry and energy. 


Summary 


We have developed the isotopic spin formalism for writing a 
wave function for a system of protons and neutrons. A proton and 
a neutron are considered as two states of one particle, the nucleon, 
distinguished by the isotopic spin quantum number 7, = +4. 
This leads to the use of 7, Mp as total quantum numbers for a 
system. The exclusion principle in this formalism takes the form 
that a wave function must be antisymmetric under any permutation 
of the nucleon co-ordinates, the orbital, spin and isotopic spin 
co-ordinates being simultaneously permuted. 
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PROBLEMS 


28.1 Consider the operator P,, = —4(1 + 47,°7,). Show by 
giving 721, Tze all possible values that 


Pyo(11, G21, Tz» Y2, O22, Tz2) = —W(1, O21, Tae, V2, O22, Ter): 


Also derive this result by showing that T? = 1 — P,, and operating 
on the functions (28.8). Hence using the exclusion principle show 
that P,, operating on a wave function interchanges the orbital 
and spin co-ordinates of nucleons J and 2. 

28.2 Show that the operator P = 7,,7,_1,.7_2. has the eigen- 
value 1 when operating on a two proton state, and 0 otherwise. 
Hence use it to write down the Coulomb potential energy between 
nucleons. Reduce this operator analytically to the form (28.4). 

28.3 Show that [°) is equivalent to I’, where I’ is the re- 
presentation of Table 3 of the group P,. In (28.17), show that the 
conjugate of 4°) is just 4%. Formulate generally in terms of 
group characters the condition under which 4¢ ©") is equivalent 
to A). 

28.4 Consider the electron configuration (2p)?. Write down all 
possible orbital functions for the three electrons and sort them into 
rectangles (L, M7; ,A, 1) whose rows and columns transform accord- 
ing to DY) and A under rotations and P,. Do the same for the 
spin functions U(S, Ms; », 7). By multiplying all orbital by all 
spin functions, write down what terms one would naively expect 
from this configuration on the basis of § 11 (neglecting the exclusion 
principle). Using (28.18) determine which are allowed by the 
exclusion principle. Also prove that the exclusion principle never 
excludes part of a term, but always allows or excludes the whole of it. 


29. Nuclear Forces 


. Charge independence 


There are several kinds of force acting between the nucleons in 
a nucleus. The most important ones constitute a specifically nuclear 
force and are referred to as the “strong interactions’. Next in 
importance comes the Coulomb repulsion between the protons. 
. This is considerably smaller: for instance the strength of the nuclear 
potential is of the order of 25 MeV (Blatt and Weisskopf 1952, 
p. 54), whereas in an alpha particle the Coulomb energy between 
the protons at a separation of 2.10-15 cm is 0-7 MeV. Then there are 
effects due to the mass difference between a neutron and a proton, 
and interactions involving their magnetic moments. These are 
all thought to be due indirectly to electromagnetic effects and, 
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together with the Coulomb force, constitute the “electromagnetic 
interactions’. Finally there are the “weak interactions’ which 
couple nucleons with electrons and neutrinos, and which are respon- 
sible for beta decay (§ 33). In this section we shall only consider 
the strong interactions, and often refer to them loosely as the nuclear 
forces. 

These strong interactions appear to have at least two symmetry 
properties. The first one is that they are invariant under the space 
inversion IT (3.11), so that to a very good approximation one can 
associate a definite parity with each nuclear level (Lee and Yang 
1956). The more important second property of the strong inter- 
actions is that they are independent of the charges of the particles, 
ie. the forces between two protons (p-p), two neutrons (n-n), and 
neutron and a proton (n-p) are all the same. The evidence for the 
equality of p-p and n-n forces comes from the great similarity 
between “mirror” nuclei like H? and He’, which differ only by the 
interchange of protons and neutrons (Bethe and Morrison 1956, 
pp. 9, 116). The equality of p-p and n-p forces is indicated by 
p-p and n-p scattering experiments (Bethe and Morrison 1956, 
p. 97), by the cross sections for the reactions 


p+p->a+D 
ptnor+D 


(§ 30; also Henley e¢ al. 1953), and by the energy levels of nuclei as 
discussed below. In addition there appears to be no experimental 
evidence of any other kind contrary to this view. 

The charge independence property of the strong interactions can 
easily be expressed in terms of the isotopic spin formalism of the 
previous section. Clearly if the Hamiltonian does not involve 
Tz, Ty, Tz, then it acts equally between any pair of nucleons. How- 
ever such a form for the Hamiltonian is unnecessarily restrictive. 
It is sufficient that the Hamiltonian be invariant under rotations 
in isotopic spin space. Thus it could for instance involve the term 
%,°%,. For consider a complete set of functions ¥(«, 7, Mr) for 
a system of A nucleons, where « denotes all the other quantum 
numbers besides 7', Mr necessary to specify the function. If #.4 
(si = strong interactions) is invariant under isotopic spin rotations, 
then the functions #si}(«, 7, Mr) transform just like the #’s 
according to the irreducible representation D‘T), Thus from the 
matrix element theorem of § 13, we have that 


[yer 7) si Yla,T, Mr) = Spr’ Sym! a(a,x’,7') 
(29.1) 
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is non-zero only when T = 7”, Mr = Mr’, and then does not 
depend on Mr. Now the (a, 7, Mr) for different values of Mr 
refer to states in which the nucleons always have a particular 
orbital and spin wave function, but which differ in the number 
of protons (28.7). The result (29.1) then indicates that the Hamil- 
tonian #,; acts the same way on all these states independent of 
the number of protons, and therefore satisfies our condition of 
charge independence. 


Form of the interaction 


We have just proved that if #,; is invariant under rotations in 
isotopic spin space, then it merits the description “‘charge-indepen- 
dent”. However, we know experimentally that #5; is charge 
independent and we want to express this in the isotopic spin forma- 
lism. I.e. we need to prove the converse of the above, namely, that 
A; being charge-independent, is invariant under isotopic spin 
rotations. To do this we have to consider the form of all possible 
types of interaction. Let B be any operator and let 


BE, = ag, + bé_ ’ 
Bé_ = cf, + dé_. (29.2) 
The matrix of coefficients can be expressed as a linear combination 
of the matrices (28.3b) plus the unit matrix, and correspondingly 
the operator B can be written 
B=}ta+d) +er, + br + fa — d)rz. (29.3) 


At this stage we limit ourselves to interactions involving only pairs 
of particles. It then follows from (29.3) that we may restrict our- 
selves to linear combinations of the following 16 terms. 


T1+ + T24> 1 - Ta, 714.7245 T1~.T2—5 
714722 — T1zT24; T1_Tez + TizT2_; (29.4a) 
Tiz — T2zs 4(714.72_ —— T1724), Tz + T225 TizgT22, 
(29.4b) 
1, $714.72. + T1724) + T1zT22 = T° Te. (29.4c) 


Now nuclei have a definite charge, corresponding to definite num- 
bers of protons and neutrons. From (28.7) this means that Mp 
is a good quantum number for designating the eigenstates of #1. 
Thus #3; is invariant under isotopic spin rotations about the z-axis, 
so that it cannot contain any of the terms (29.4a). This argument 
of course does not apply to the weak interactions leading to beta 
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decay, for these couple states with different charge and Mr. Next, 
by operating with each of the interactions (29.4b) on wave functions 


Y = $n, oa, 12, 022)51;m, with Mr =1,0, —1, 


it can be seen that they give quite different results for different 
values of M7 and hence cannot be considered charge-independent. 
For instance 


(nz +re)YV=PY ifMyp=1 (p-p), 
—0 ifMy=0 (p-n), 
= — WY if U7 = —1 (n-n). 


It can also be shown that no linear combination of these four inter- 
actions can be used in a charge-independent Hamiltonian (problem 
29.2). This only leaves the two terms (29.4c), which are invariant 
under isotopic spin rotations and which give charge independent 
forces as already shown in the last paragraph. We have therefore 
proved that the conditions of charge independence and invariance 
under isotopic spin rotations are equivalent. 

We can now easily deduce the actual form of the Hamiltonian 
if we first make the assumption that #.; does not depend on the 
momenta (nor the velocities nor angular momenta) of the particles. 
There is no compelling reason for this assumption; it is made here 
for the sake of simplicity in the absence of any definite evidence 
to the contrary (Bethe and Morrison 1956, p. 99). As in (29.4) we 
need only consider the 16 products up to the second degree of the 
spin operators 841, 82 with the identity operator. Using the irreducible 
representations of the rotation group these can be classified into 


two scalars, 1, 8,°S.3 (29.5a) 
three vectors, S, + S,, 8; — 8,, 8, A 83; (29.5b) 
one tensor, five components tm, —2 S m S 2. (29.5c) 


As regards the orbital variables, #4: can only depend on r =r, 
—r,. The functions of this can be written as 


the scalar, 1; (29.6a) 

the vector, r; (29.6b) 

the tensor, Um = 17? Vo m(8, $); (29.6c) 
etc. 


all multiplied by an arbitrary function of r. The tensors in (29.5c), 
(29.6c) are not complete nine-component second rank tensors, but 
are the five parts transforming irreducibly according to D‘?). These 


NUCLEAR PHYSICS 325 


form a symmetric tensor with zero sum of diagonal terms (see § 9). 
Now the Hamiltonian has to be invariant under rotations (there is 
no intrinsic preferred axis among the nucleons), and the only way of 
obtaining such interactions is by taking the scalar products of 
corresponding terms in (29.5) and (29.6). We obtain the following 
six scalars and pseudoscalars: 


1, 81 °S», Ltm*tm’ = 3(8,°T)(S_°T) — r78,°8,, (29.7a) 
(S, + S,)-r, (8, — S,)°r, 8, A 8,°Fr. (29.7b) 


The three terms of (29.7a) can each be multiplied by the two terms 
of (29.4c) giving the six interactions of Table 32. These are all 


| 
| TABLE 32 
Nuclear Interactions 

Wigner Vw(r) 
Bartlett (spin exchange) Va(r)s, . Ss 
Tensor Vi(r)(s, . r)(S, . FT) 
Heisenberg (space-spin exchange) Vau(r)t, . Ts 
Majorana (space exchange) Vur(r)(S, . Sg)(T, - T:) 

| Tensor exchange Vie (r)(Sq - T)(Sq . F)(T, . Ts) 


invariant under the space inversion /7, symmetric in the co-ordinates 
1 and 2, and charge independent. On the other hand the terms 
(29.7b) must be rejected because they are not invariant under 
IT (8,, 8, are invariant under J7, appendix F). 


Isotopic spin multiplets 


Since we have established that #,; is invariant under isotopic 
spin rotations, its eigenfunctions can be sorted out to transform 
according to the irreducible representations D'™). The set of 2T + 1 
states have the same energy and from what is called an isotopic 
spin multiplet or 7’-multiplet. Because of the different values of 
Mr, these states clearly belong to neighbouring nuclei with the 
same mass number. Fig. 44 contains two examples of a 7'-multiplet, 
one with 7 = 1, J =2 and one with 7 =1, J =0. The three 
states of each multiplet are seen to have not exactly the same 
energy because of the minor charge-dependent electromagnetic 
interactions mentioned at the beginning of this section. Some of 
them can easily be corrected for’, For instance the mass difference 
between a neutron and a hydrogen atom amounts to 0-79 MeV. 
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If we assume the protons in the nucleus to be uniformly spread 
out, the Coulomb energy is (Blatt and Weisskopf 1952, p. 216) 


Ec = 32(Z — lje*/R, (29.8) 


where FR is the nuclear radius. Using (29.8) and R = 1-44 A 
10-13 em, we obtain for the corrected relative energy levels of the 
J =0 T-multiplet 1:97 (Mp = —1), 1-77 (Mr =0), and 1-96 
(My =1). The remaining discrepancy is presumably due to the 


J=0, T= 


MeV 


Energy, 
nm 


(0-72)¢ V=l, 720 
) (0-00)° J=3,7=0. 


Fie. 44. Relative energies of the ground states and first excited 

states of the nuclei Be!®, B19, C1!°, The mass energies of the atomic 

electrons and the nucleons is included. The parity of all states is 
even. 


crudeness of (29.8) and other small charge-dependent effects. 
Nevertheless the degree of agreement shows that the strong inter- 
actions must be charge-independent. 


The shell model and nuclear levels 


In the theory of atomic energy levels (Chapter IT), it was seen to 
be a useful starting approximation to assume that each electron 
moves in a fixed net potential, which represents in some average 
way the effect of all the other electrons and the nucleus. Similarly 
in nuclei it is fruitful to assume that each nucleon moves approxi- 
mately independently in a certain potential well, and this leads to 
ls, 28, 2p, 38, etc., orbits as for electrons. As in the case of jj coupling 
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in atoms (§ 11), the presence of fairly strong tensor and/or spin- 
orbit forces couples the orbital and spin angular momenta of each 
nucleon to give a total angular momentum j (Bethe and Morrison 
1956, pp. 99, 165). However the order of the energy levels of the 
various states is different from that in the electronic case, and this 
also results in a different grouping into closed shells. The order 
in which the various individual nucleon levels are filled is shown in 
Table 33, where levels with nearly the same energy have been 


TABLE 33 
Levels in the Nuclear Shell Model 


Shell Levels Number Cumulative 
of states total 
1 184/3 2 2 
2 2Pejer 2P 1/2 6 8 
3 3d5/q, 2813/9, 3dg/q 12 20 
4 4fasa 8 28 | 
5 Spare» Af sins 3Pi/a» OFo/s 22 50 


The last column gives the total number of states up to and including a given 
shell. After the fifth shell differences appear between protons and neutrons, 
and between various authors (Bethe and Morrison 1956, p. 164; Siegbahn 
1955, p. 421). 


grouped together into shells. The total number of states in each 
shell is also shown, the level 2,,. being for instance 27 + 1 = 4- 
fold degenerate. Each of these states can be filled twice in a nucleus, 
- once by a proton and once by a neutron. 

As an example, let us calculate J and 7' for the low lying allowed 
levels of the A =~ 10 nuclei Be!®, B!°, C19, From Table 33 the 
configuration is (1s,/,)4(2p5/.)®. By writing down all possible deter- 
minant wave functions for this configuration in the manner of § 12, 
we can tabulate all the allowed values of 7, M7 and hence group 
them into J, 7 multiplets. As in § 12, the closed shell (18,,.)4 always 
gives zero contribution to My, Mr and can therefore be omitted. 
Also (24/.)® gives rise to the same levels as (2p5/.)", because together 
they form another closed subshell. Abbreviating the wave function 
2-1/2 SpPb(j = 3/2, 41, 71;,11, 071; TW — 3/2, Myje, Mr2;T2, 22, T2z2) 
to (mj, Ma = +)(mjs, Mrz = +), we obtain analogously to Table 5 
the values of Mz, Mr shown in Table 34. These states can be grouped 
into levels with 


Jee Teale J HOF al Je 3, 7H 0e JH 1,7 =; 
their energies being as shown in Fig. 44. Thus the J = 3 state can 


22 
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TABLE 34 
States in the Configuration (2p5/.)? 


Wave function 


(3/2 +) (1/2 +) 
(3/2 +) (—1/2+) 
(3/2 +) (—3/2 +) 
(1/2 +) (—1/2 +) 

(3/2 +) (3/2 —) 

(3/2 +) (1/2 —) 

(3/2 —) (1/2 +) 
(3/2 +) (—1/2 —) 
(3/2 —) (—1/2 +) 
(3/2 +) (—3/2 —) 
(3/2 —) (—3/2 +) 

(1/2 +) (1/2 —) 
(1/2 +) (—1/2 —) 
(1/2 —)(—1/2 +) 
and corresponding wave functions with 
t (a) J=2,T=1; (b)J= 
(ec) J = 3,7 =0; (d) J= 


levelt 


© 


corcorrnwuwoorn & 
eecooooooorene § 


Bomarronpeneooge 


nega — ae My 
0,7 
1v 


only apply to B® and not to Be!®, C!°, because it is an isotopic 
singlet (M7 = T = 0). On the other hand the J = 0, 7 = 1 level 
applies to all three nuclei Be!®, B!, C10 (M7 = —1, 0, 1), though 
the energy is not exactly the same for all three due to the Coulomb 
energy (29.8) and the mass difference as 3 already discussed. As in 
(11.24) the parity is given by 


and is even for all the states mentioned above. 


The deuteron 


For a system of two free nucleons, the only force acting is the 
interaction between them and this depends spatially only on the 
difference r = r, — r, of their co-ordinates. Thus the wave equation 
becomes “separable” in terms of the centre of mass co-ordinate 
R = }(r, +1.) and r, and the wave function can be written 


(Schiff 1949, p. 81) 
PalR)P(r) Uy Maps £08, (29.9) 


or rather as a linear combination of such wave functions. Here 
#a(R) gives the motion of the centre of gravity. Instead of the 
form (29.9), it is convenient to introduce the isotopic spin functions 
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(28.12) S744, with J = 0,1 and the corresponding spin functions 
Us uy, With S = 0, 1, and thus to use the set of wave functions 


~ = $4(R)d(k) (29.10) 
where a(k) = b(r)Us ums2r.Mp- (29.11) 


We have used the symbol d (for deuteron) in (29.11) to denote that 
d(k) is an internal state of the deuteron. We now assume that in 
the ground state of the deuteron the orbital function ¢;(r) is a 
spherically symmetric 1s type of function ¢,.(r) with Z = 0. Then 
to make (29.10) antisymmetric, we must either use S=1, 7 =0 
or S=0, T =1. This gives two states for the deuteron, J = 1 
(7 =0) and J = 0 (7 = 1). The former one has the lower energy 
and is the usual ground state. The J = 0, 7’ = 1 state is actually 
completely unstable, as we would expect from the fact that two 
protons or two neutrons do not form a bound system. 

So far we have designated the ground state of the deuteron by 
the quantum number J = 1, but if the wave function were precisely 
of the form (29.11) then DL and S would separately be good quantum 
numbers. This would be so if the strong interactions did not include 
any tensor interactions (Table 32) nor spin orbit coupling, since 
the other interactions are invariant under separate orbital and spin 
rotations. In particular we would have L = 0, and this would 
imply a zero quadrupole moment <Q> for the deuteron with respect 
to its centre of mass 


<Q> = | ¥*Qy do 
where = Q@ = 3 [8(, — Z)® — (r, — R)*b + 721) + 
+ HB(2q— Z)— (r,—R)*b+ 722) (29.12) 
= 2(32* — (1 + Tr) 


(Bethe and Morrison 1956, p. 104). However it is known experi- 
mentally that the deuteron does have a quadrupole momient, as 
we would expect in general for a J = 1 nucleus from the considera- 
tions of § 21. This can be explained by assuming that #; contains 
some tensor (or tensor exchange) interaction tens (Table 32) 
which we shall treat as a perturbation. The unperturbed wave 
function d(1s) then becomes 


Ey a — By—Ey 


Since # tens is invariant under simultaneous orbital and spin rotations 
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and under isotopic spin rotations, from § 13 all d(&) in (29.13) must 
have J = 1, T = 0 like d(ls). Since tens transforms according 
to D‘ under orbital rotations or under spin rotations, the coefficient 
of d(k) in (29.13) is zero unless d(k) has in addition Z = 2, § = 1, 
2 or 3. In the present case of only two nucleons only S = 1 applies. 
Thus the ground state d of the deuteron can be written as the sum 
of terms (Ms = +1, 0) of the form 


b4(7)0y ms%o,0 and balt) Yo,m(9,9)U 1 mgHo0, (29.14) 


where ¢$,Y>,1 is a d-symmetry (/ = 2) orbital. In fact it can be 
shown that this form applies exactly, not only in first order pertur- 
bation, if ¢s is interpreted as an arbitrary | = 0 orbital differing 
somewhat from the original ¢1s (problem 29.8). Now from (29.12) 
the quadrupole moment of the ground state is a linear combination 
of terms 


[teQ¢aYesedv and | ba*QbaY2,m dv, 


and does not vanish identically. 

These conclusions about the wave function of a deuteron can 
be summarized conveniently as follows. The final wave function 
can be written in the form 


¢ = ba(R)dm, (m = My = 0, +1) (29.15) 


where 45,(R) describes the motion of the centre of gravity R, and 
dm is a function of the form (29.14) describing the internal motion. 
We also have that d, transforms according to D under orbital 
and spin rotations (J = 1), according to D®) under isotopic spin 
rotations (7’ = 0), and has even parity, which completely describes 
the symmetry of (29.15). Now given this symmetry, we do not 
need to consider for most purposes the detailed internal structure 
of the deuteron. We can regard it as if it were a new type of funda- 
mental particle described by the “spin” functions dm, where the 
three states m = 0, +1 refer to some internal degree of freedom. 
This is why we have used small letters d and m in (29.15), in accor- 
dance with the usual convention of using small letters for single 
particles and capitals for composite systems. From this point of 
view the deuteron is often described as having a “‘spin’’ of 1, although 
the quantum number J = | is really determined by a combination 
of internal orbital motion and true spin angular momentum.f 

} If there is likely to be confusion with the electronic angular momentum, 


then J is used instead of J for the total internal angular momentum, i.e. 
“gpin’’, of a nucleus or other compound particle. This is done in § 21. 
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The same argument can be applied of course to any nucleus, and 
we have already made extensive use of this in § 21 where we treated 
the nucleus as a black box characterized by a certain net “spin”, 
magnetic moment, quadrupole moment, etc. There is thus a com- 
plete analogy with electrons described by u,, or nucleons by é,,, 
the only difference being that in the case of the deuteron we happen 
to know explicitly how the internal degree of freedom describes 
the relative motion of the proton and neutron. This internal motion 
is described by saying that d, contains a mixture of orbital states 
EL =0 and L = 2, and is an eigenfunction} of 8? corresponding to 
S=1. 


Other particles 


As already mentioned the same type of description (29.15) can 
also be applied to other particles. For instance the internal wave 
function « of an alpha particle has J = 7' = 0 as expected from the 
shell model. Thus we can write for an alpha particle wave function 


where #,(R) corresponds to the motion of the centre of gravity R, 

and where the function « is invariant under orbital and spin rota- 

tions and under isotopic spin rotations, and has even parity. 
Similarly for 7-mesons we write 


$= Ym(R)7™, m=0,+,-, (29.17) 


where zrt, 7°, 7 denote the internal wave functions of the corre- 
sponding 7-mesons. Experimentally it is known that these particles 
have zero intrinsic spin angular momentum, so that the functions 
a™® are all invariant under rotations. Also [J7™ — —z™ where IT 
is the space inversion, i.e. the 7™ all have odd parity. 

It is also known experimentally that z-mesons are intimately 
connected with nuclear forces (Bethe and Morrison 1956, pp. 29, 
147). In particular it is found that the forces between 7-mesons and 
nucleons have about the same range and strength as inter-nucleon 
forces, and that they too are very nearly charge independent. It is 
natural therefore to look for a unified interaction theory which 
embraces 7-mesons as well as nucleons. We achieve this by regard- 
ing the three types of 7-meson as three different internal states of 


ft However the d» do not transform according to D® under spin rotations 
and Ms is not a good quantum number. 
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one particle described by the isotopic spin co-ordinate 7, the func- 
tions 2+, 7°, w- transforming as standard base vector (8.18) with 
Mr =1, 0, —1 and 7 =1 under isotopic spin rotations. The 
strong interactions can again be expressed in terms of the 7 operators 
and thus again become charge-independent if they are invariant 
under isotopic spin rotations. 

Similarly the isotopic spin formalism can be extended to all 
particles which interact via the strong interactions and Table 35 


TABLE 35 
Quantum Numbers of Particles Having Strong Interactions 


Particle Mass Spin Parity 7 My N Strange- 
ness 

Nucleons 

Proton 1840 4 even $ $ 1 0 

Neutron 1840 4 even t —4$ 1 0 

Antiproton 1840 $ odd 4 —4 —1 0 

Antineutron 1840 4 odd $ + —l1 0 
Mesons 

at, 1°, 7 270 0 odd 1 1,0, —1 0 0 

Kt, K°, 970 0 ? 4 +4 0 1 

R°, K- 970 0 ? 4 +4 0 —1 
Hyperons 

Ae 2180 4 ? 0 0 1 | 

z+, D°, J- 2330 4 ? 1 1,0, —1 1 —1 

8°, S- 2580 4 : 3 +4 1 —2 


The mass is in units of the electron mass. 


shows the values of 7 and M, for the wave functions describing 
some of these particles. In addition it has been found that there 
are other useful quantum numbers describing the particles, namely 
the nucleon number NW and the strangeness. These are presumably 
connected with an invariance property of the Hamiltonian under 
some transformation involving a new co-ordinate not yet under- 
stood. In each case the charge of the particle is 


e[Mr + 4N + }(strangeness)], 


this being the generalization of (28.7). The concept of isotopic 
spin does not appear to apply to electrons, positrons, neutrinos 
and » mesons (collectively known as leptons) which do not interact 
via the strong interactions. 
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Summary 


The strong interactions among nucleons and other fundamental 
particles have been discussed, particularly the consequences of their 
being charge-independent. The form of the Hamiltonian, the 
existence of isotopic spin multiplets, and the states of the deuteron 
and other particles have been described. 


PROBLEMS 


29.1 Suppose that nuclear forces are only charge symmetric 
but not necessarily charge independent, i.e. the n-n and p-p forces 
are equal to one another but not to the n-p force. Which of the 
interactions (29.4) would then be allowable in the Hamiltonian? 

29.2 Operate with each of the interactions (29.4b) on the &’s 
(28.8), and show that no charge-independent linear combination 
can be formed from them. Note that it is necessary to use the 
conditions (i) that the Hamiltonian must be symmetric as regard 
interchange of particle co-ordinates, and (ii) that the linear combina- 
tions must contain all real coefficients so that the Hamiltonian is 
Hermitian (Schiff 1949, p. 129). Also list critically all the assump- 
tions that have been used in arriving at Table 32. 

29.3 Suppose that the nuclear Hamiltonian is invariant under 
Space inversion, but is not necessarily charge-independent. Which 
of the following interactions would be allowed? (8, + s8,) A p-r, 
S) °T1 + Se° Ta, (Si A S2'r)(T1* Te), [(81 — S2) A P* Fo(t1z — 722)] 
8, °Sol(7147) — 74, 7.,), Where r=r,—Tr, p =p, — Po. 

29.4 Are the electromagnetic interactions invariant under 
isotopic spin rotations about the z-axis? 

29.5 Derive the J, T and parity quantum numbers for the energy 
levels of Li’, Be®, B® arising from the lowest configuration. 

29.6 Assuming the Hamiltonian is charge independent and 
invariant under space inversion, show that for two nucleons the 
energy eigenfunctions are also eigenfunctions of S? where S is the 
total spin angular momentum. 

29.7 Assuming that the Hamiltonian contains only the Wigner 
and Heisenberg interactions (Table 32), what would the sign of 
Vu(r) have to be to account for the deuteron energy levels, 

29.8 Show that (29.14) gives the general form of the deuteron 
ground state wave function in the presence of tensor interactions. 
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Using Wigner coefficients, write down the correct linear combina- 
tions of the terms (29.14) corresponding to eigenstates with m 
= M, = 1,0, —1. 

29.9 Suppose that the nuclear Hamiltonian contains the spin- 
orbit term Vzs(r)(r, — f2) A (Py — Pe) * (8; + 82) and the inter- 
actions of Table 32 excluding the tensor ones. Would this account 
for a quadrupole moment of the deuteron in the ground state? 

29.10* Derive all the nuclear interactions that are allowed by 
symmetry (excluding the assumption of charge independence) which 
are independent of, or linear in, the momenta of the particles 
(Eisenbud and Wigner 1941). 

29.11 Several nucleons are interacting via charge-independent 
forces. It is asserted that therefore the charge density in the 
resultant nucleus must be equal to eZ/A times the particle density. 
Either prove this assertion, or discuss why it is incorrect. 


30. Reactions 


In the previous two sections we showed how the symmetry pro- 
perties of the eigenstates of nuclei are discussed, especially as 
regards isotopic spin. We shall now apply these concepts to reactions 
when the nuclei take part in time-dependent processes. In particular 
we shall illustrate how symmetry properties can be used to deter- 
mine selection rules for reactions, can be used to discuss the angular 
distribution of the reaction products, and to calculate the ratios 
of cross sections if we have several related reactions. 


Parity of the 7 meson 


As a first example to illustrate the kind of problem that has to be 
discussed, we consider the reaction 


am +d—>n-+n, (30.1) 


and use it to determine the parity of the 7~ meson. What happens 
in practice is that the 7~ meson is stopped in some material con- 
taining deuterium. Being negatively charged, it can go into stable 
electron-like orbits about the deuteron, quickly falling down to the 
lowest 1s level. The radius of this orbit is 273 (= m,/m,) times 
smaller than the corresponding electron orbit because of the mass 
difference. Thus the meson spends quite an appreciable fraction 
of the time inside the nucleus, during which the reaction (30.1) can 
occur. The initial wave function before the reaction can be written 
in the notation of (29.15), (29.17) 


Yi = Wy2([Pa — Ta|)r-dm. (30.2) 
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Throughout this section we shall use suffices i and f to stand for 
““nitial’’ and “‘final’’. Since the 7-meson has zero spin (Table 35) 
and the deuteron has spin 1, ¥; transforms according to D™ under 
rotations. The deuteron has even intrinsic parity (see previous 
section), so that the initial parity is 


UW Was (30.3) 


where w, is the intrinsic parity of the 7-meson which we want to 
determine. Let us now consider the two neutrons in (30.1) and 
write down a set of basic wave functions 


P = dir, — te) 5 ye Erb-2 (30.4) 


to describe them, where #; transforms according to D under 
rotations, and where S = 0 or 1. To make Y antisymmetric, we 
must have either / = even, S = 0 (ie. % symmetric, Usy, anti- 
symmetric), or 1 = odd and S = 1. Using the spectroscopic nota- 
tion of § 11, we have therefore the states 4S, ?P 91 or 2, Des *Fa.3 or 4; 
etc., and an arbitrary J = 2 state for instance would be a linear 
combination of the *P,, 1D., °F, states. We now assert that the 
final J and parity after the reaction must be the same as the initial 
ones. Thus the initial value J = 1 limits us to the final state °P,. 
This has odd parity since J = 1, so that from (30.3) the 7~ meson 
has odd parity. Clearly the crux of this argument is the assertion 
that the J and parity do not change during the reaction. To prove 
this we first have to establish the following theorem. 


Fundamental theorem 


Consider an atom in a box containing also some electromagnetic . 
radiation, and consider two energy levels of the atom #, and £, 
with wave functions ¥, and ¥,. We recall (Schiff 1955, p. 28) that 
a stationary state of an isolated system depends on the time only 
through a factor exp(—tHt/h) and is an eigenfunction of the Hamil- 
tonian. Now y, is an eigenfunction of the atomic Hamiltonian, but 
it is not true to say that having the atom in the state ¥, gives a 
stationary state for the whole system because the atom may absorb 
(emit) a quantum of radiation and make a transition to 4,. To 
obtain a stationary state we would have to take a linear combina- 
tion of states with the atom in , and N photons, and with the 
atom in %, and N — | photons in the box. 

Similarly we could write down a Hamiltonian #; for the initial 
three particles in (30.1) (7~, proton, neutron), using two-particle 
interactions of the form of Table 32 and the Coulomb force, and we 
would presumably find (30.2) to be a satisfactory eigenfunction. 
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However such a Hamiltonian H, is incomplete, and the initial 
state (30.2) is not really a stationary state of the whole pion- 
nucleon field, as shown by the fact that the reaction (30.1) is observed 
to occur. #’; is particularly inadequate in that it refers to a fixed 
number of particles, namely three, and cannot describe processes 
involving annihilation or creation of particles. Thus we require a 
field theoretical Hamiltonian that can. How this is achieved can 
be seen from the Coulomb repulsion. The coulomb interaction 
between two particles is usually written e,¢,/r,., but by using the 
electric field & we can express this electrostatic energy for several 
particles in the form (1/87) [2 dv while leaving the number of particles 
completely unspecified. In the following we shall assume all Hamil- 
tonians to be of this very general field-theoretical type. Further- 
more in this picture a reaction is described, not by a sudden change 
of the wave function from one type to another, but as the continuous 
development of one wave function under such a general Hamil- 
tonian. 

We now prove the following theorem. If # is invariant under 
a group P of transformations T (not involving time), and if a set of 
wave function y,,(£) transforms according to the representation D,,(T) 
at some time t = ty, then the y(t) transform according to the same 
representation D,,(T') at all times t. Instead of giving a field theoretical 
discussion as we should, we shall just use the time-dependent 
Schrédinger equation 


Fy 
|x (q) — th A #.(q, t) = 0, (30.5) 


where g represents all the co-ordinates of the system besides ¢. 
Now the operator in (30.5) is invariant under &, and the eigen- 
functions ¥,(q, t) with eigenvalue zero can be sorted out to trans- 
form according to irreducible representations of G or any other 
convenient way, e.g. according to D,,(7'). Since the transformations 
do not affect ¢, it is just a parameter having any value from — oo 
to oo. The transformation properties thus remain constant in time, 
which proves the theorem.} Two variations of this proof are indicated 
in problems 30.1 and 30.2. As an example, in the reaction (30.1) 
the Hamiltonian is invariant under rotations and space inversion. 


f An even more direct way of seeing the theorem is to note that the solution 
of (30.5) can be written 


bul t) = expli(t — ty) Wyulg, to). 
The exponential is invariant under G, and so #u(q, t) transforms in exactly 
the same way as ¢,(q, ty). 
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Also the initia] state is one of definite J and parity, so that these 
quantum numbers cannot change during the reaction, which is 
what we asserted. 

This theorem is often expressed in the following somewhat 
weaker form: if an operator commutes with the Hamiltonian, tt is 
a constant of the motion. This follows from equation (17.1) 


a = (1/ih)(Ax? — #A). 


It can be related to the earlier form of the fundamental theorem 
by noting that all symmetry transformations commute with the 
Hamiltonian and are thus constants of the motion (see equation 
(17.5)). The weaker form of the theorem is quite adequate for many 
applications, e.g. the reaction (30.1) already discussed. Here the 
angular momentum operators J? and Jz, and the parity operator 
II commute with # and are therefore constants of the motion. 
Thus the initial and final wave functions are designated by the 
same values of the quantum numbers J, My, and w. 
Consider now the reaction 


Mt+a>d+ Mp, 


called an (a, d) reaction, where Ny, Nz are the initial and final 
nuclei. The initial wave function is 


Y, = by he (30.6) 


where we really should include an antisymmetrizing operator 
[a + p)!}-¥/? & 8pP as in (28.14) but this does not affect the other 
transformation properties. Since «-particle and deuteron functions 
are invariant under isotopic spin rotations (see equations (29.15) 
and (29.16)), we have 7’; = 7; where these are the 7 values of the 
nuclei, i.e. we have the selection rule 


AT = 0 for (a, «), (a, d), (d, «), (@, a). (30.7a) 


Similarly («, «), (d, «), (d, d) reactions have the same selection rule 
as shown. For example consider the reaction 


Be + dd + Bie 


leaving the final nucleus in an excited state. We suppose that the 
B?° nucleus is initially in its ground state, which from Fig. 44 has 
T =0. The reaction cannot therefore leave the nucleus in one of 
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its J =1 states, which helps one to determine experimentally 
what the 7' values of the various states are (Fig. 44). Similarly 
we have the selection rule 


AT = 0, +1; for (n, n), (, P), (P, P), (DP, n) (30.7b) 


w-meson production cross sections 
We shall now consider the differential cross sections o,, (Schiff 
1955, p. 96) for the reactions 
p+p>n-+d, (30.8) 
ptn>rtd. (30.9) 
Let the nucleons initially be in statest y and ¢. For (30.8) the 
initial wave function is 
(1) G(2) — (2)6(1) 
v2 
Under the influence of the strong interactions this wave function 
will change with time into 
YM, =a¥(p + p) + bP (at + d) (30.11) 


where the coefficients will depend on the energy and on ¥ and 4, 
for instance on whether the nucleons are really heading towards 
one another! Thus there is a certain probability |b|? of finding the 
final state m+ +d, and this gives the cross section for reaction 
(30.8). Similarly if in (30.9) we take the same ¢ and ¢ to refer to 
the proton and neutron respectively, the initial wave function is 


Fi,pn = 2-V(1)$(2) E416 2 — ¥(2)6(1)E42€-1] 
= ${¥(1)P(2) + ¥(2)P(1)]Zo,0 


+ a6) — HED fabitay 


Fi,pp = a (30.10) 


Now the state 79 +d has 7 =~ 1, so that the first term in (30.12) 
with 7’ = 0 can never change into a 7° + d state. The second term 
is related to (30.10) by isotopic spin rotations, and since the strong 
interactions are invariant under isotopic spin rotations we have, 


t The double use of ¢ as the azimuthal angle in ogg and as a wave function 
in (30 10) should cause no confusion. 


NUCLEAR PHYSICS 339 


using the fundamental theorem, that this term in the wave function 
changes with time as follows, 


1)g(2) — Y(2)A(1 
HONE) VIO gain +n) + OMe? +a 
with the same coefficients as in (30.11). Since (30.12) contains an 
extra factor 2-1/2, the probability of obtaining the products 7° + d 
is only half that for (30.11), and we have 


oop +n > 19 +d) = foy(p +p >at +d). (80.13) 


This relation holds for any particular initial geometrical arrange- 
ment and energy, and for all angles of the reaction products. 


The compound nucleus 


Suppose a light nucleus is bombarded with particles such as 
n, P, d, «, ete., with a few MeV energy. The particle may not hit the 
nucleus at all or it may “bounce off the surface” or just suffer a 
deflection due to the Coulomb force. But if the particle penetrates 
the nucleus, it immediately interacts with the nearest nucleons with 
the strong interactions of the order of 25 MeV, so that it rapidly 
loses its excess kinetic energy and becomes part of the nucleus. 
In general the probability of forming the compound nucleus is 
high only when the amount of energy available corresponds to a 
definite excited state of this nucleus, and consequently the cross 
section exhibits sharp resonance peaks around the appropriate 
energies. Now this new compound nucleus is in general quite 
unstable because of the extra nucleon(s) and kinetic energy. Never- 
theless, it takes relatively quite a long time for the compound 
nucleus to break up, because statistical fluctuations have to con- 
centrate enough of the extra energy on one particle for this to escape. 
We therefore have really a double reaction 


N; + particle > Nc > Mt + particle, (30.14) 


and can apply selection rules to each stage of the reaction. For 
instance the isotopic spin selection rules (30.7a) and (30.7b) become 
respectively 


T=To=T and Trt4=To=7T: +}. 


Angular distribution of reaction products 


Suppose for example the compound nucleus is left in a J =0 
state. Then the reaction products must come out with spherical 
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symmetry in the centre of mass co-ordinate system. Conversely 
if the reaction products emerge completely isotropically, then this 
indicates strongly that the compound nucleus state has J = 0 if 
it contains an even number of nucleons. The corresponding result 
for Jc > 0 is that the differential reaction cross section o(8, ¢) 
cannot contain terms in cos 9 higher than (cos 6)* when J is an integer, 
and (cos 6)?7-1 when J ts half an odd integer. This we shall now 
prove, and call the first angular distribution theorem. 

Let ¢om with m = My, be the initial state of the compound 
nucleus. The final wave function can be written (Schiff 1949, p. 101) 


Kom = Fmp(; (a <i). (p summed) 


representing an outgoing wave, where v is the speed of the emerging 
article relative to the final nucleus. Here ®, is the product of the 
intrinsic wave functions of the outgoing particle and of the final 
nucleus, and p indicates their spin states. The cross section is 


o(9, ) = 2Amom, (30.15) 
where Om = >.|Fmp(9, ¢)|*. (30.16) 


We have summed over p in (30.16) because we assume that the spin 
directions of the final nucleus and particle are not being measured, 
so that we want the cross section summed over all the different 
spin states p. In (30.15) adm is the probability of the compound 
nucleus being formed in the state yom. If also the initial nuclei 
and the incident beam are both unpolarized, then everything is 
symmetrical about the axis of the incident beam which we take 
as the z-axis. We have am = 4@_m, and that am is independent 
of the angle ¢. Consider now the effect of a rotational co-ordinate 
transformation 7. The jcm transform according to D,Y), 
Suppose the Ff, transform according to an undetermined repre- 
sentation Dim. Also the ®, transform according to the product 
representation DV!) x DW») where J;, Jp are the values of J for 
the final nucleus and particle. For simplicity we shall write the 
matrix in this product representation as t,,. Now by the funda- 
mental theorem above, Ym and %cm transform in the same way, 
so that we have 


TE ys = DEY = DP 859 Fine P vA 
= TF iy°P,A] 
= Ding tty on EF mp PA; 
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where A is the spherically symmetrical factor exp(ikr)/rv/?. Thus 
we have 


Danpm’p'tp’ot = D).8,0°- (30.17) 


Since rotations are unitary transformations, ¢ is & unitary matrix 
(appendix C, lemma 2 and problem C.2) and #* =#-!. Solving 
(30.17) we have 


J 
Dng,m'p? =— DQ, < 


Thus the functions F,,, transform in the same way as Yiom®,*, 
and from (30.16) om transforms like 


b* omifom(2P,*®,). (30.18) 


Now we have (using the summation convention) 
TD*D, = th stor Py*D,- 
= byt hp D,*O,- 
= O,*6,,, 


so that >®,*®, is invariant under rotations. Consequently each 
om and hence also o transforms like ¢om*¢com (m not summed), 
Le. according to 


D” x D® = 5D 
q 


where the highest value of | is 2J. Thus if o(6, 4) is expressed in 
spherical harmonics Yim(8, 4), the expansion cannot contain terms 
with | > 2J. Hence expressed in terms of cos 6, powers higher 
than (cos @)*” cannot occur. Also since yo, and o, have definite 
parities, from (30.18) o is invariant under inversion and contains 
only even powers of cos 6. Hence when J is half an odd integer, 
powers of cos @ higher than (cos @)%~1 cannot occur, which proves 
the theorem. These results furnish an important method of deter- 
mining the J value of excited states of nuclei. 


Angular momentum and parity selection rules 


From the fundamental theorem, the J value and the parity of a 
system remain unchanged during a reaction. However, this is not 
as useful as it might at first appear because in general the initial 
state does not have a definite angular momentum nor definite parity. 
Classically a parallel incident beam contains particles having all 
angular momenta from zero to infinity relative to a given target 
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nucleus, depending on how close the path of the particle is to the 
nucleus (Fig. 45). Quantum mechanically the incident beam is 
represented as a plane wave 


[8] 


etkz — ¥ (21 + 1)i491(kr)Pi(cos 6), 


jl) = (a7/22)¥/7J 141;2(2), 


(30.19) 


where the components transforming according to different re- 
presentations D™ under rotations are exhibited explicitly, and 
where ji(kr) is a spherical Bessel function. The 1=0,1,2... 
components are called the s-, p-, d-wave, etc., and they have parity 
(—l). 


eT 7 Eerie ont 


. io 
Py tT dd 


Incident beam 
Fig. 45. A beam of particles incident on a nucleus. The angular 
momentum of a given particle about the nucleus is mva, where a is 
called the impact parameter. It is the distance of closest approach 
to the nucleus if the particle followed a straight path. 


NN 


\ 


Although the incident wave contains all these components, not 
necessarily all of them are involved in a reaction. Classically the 
larger the angular momentum /h, the larger is the distance a = lh/mvu 
(Fig. 45) of closest approach to the nucleus, and if a > R where R 
is the nuclear radius, then the particle will not hit the nucleus at 
all. Quantum mechanically we have for small r 


kr)! 


ju(kr) = arent + higher powers of kr], 


so that from (30.19) the probability of a particle being close enough 
to the nucleus to interact strongly with it is small for large I and for 
small velocities (large &). For very low velocities only the s-wave 
contributes significantly to any reaction, and more generally a 
reaction is dominated by the wave with the lowest / that can con- 
tribute to the reaction due to selection rules. 
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We shall now show how a reaction has been used to establish 
some information about a particular excited state of the B1° nucleus. 
Consider the formation 


p(2:56 MeV) -+ Be® (ground state) — B10* (30.20) 


of the 8-89 MeV excited state of the compound nucleus B®, and the 
three modes of its decay 


Biot —» x + Lis**(0, +, 1; 3°58 MeV state), 
Biot _» « + Lit*(3, +, 0; 2-19 MeV state), (30.21) 
B10* _+ » + Li%1, +, 0; ground state). 


Here the asterisks are used to denote excited states, and the sets of 
quantum numbers are J, w, T. In the formation (30.20) of the 
compound nucleus, we do not know a priori which | component of 
the incident wave dominates the reaction, and therefore gain no 
information about the compound nucleus. The same almost applies 
to the outgoing a-particle wave in (30.21), but it tells us a little bit. 
Experimentally only the first of the decays (30.21) is found to occur. 
Since the first decay occurs and the «-particle has 7’ = 0, we con- 
clude that the B1* state has 7’ = 1 like the 3-58 MeV Li®** state, 
and this is consistent with the other two decay modes not occurring. 
The anisotropy of the angular distribution of the a-particles would 
then set an upper limit on the value of J. However there is an 
alternative explanation. In nuclei only the strong interactions are 
invariant under isotopic spin rotations, whereas the electromagnetic 
interactions which are about 100 times weaker are not invariant. 
Thus 7 is not an exact quantum number for designating a state, and 
it could be that the B!°* nucleus is in a predominantly T = 0 state 
with a little (~ 1%) of a T = 1 state mixed in. This would explain 
the first reaction (30.21) being observed. Now the isotopic spin 
selection rule allows the other two reactions (30.21) to occur also, and 
the fact that they are not observed must be due to an additional 
selection rule. This restricts the possible values of J and w for the 
B19 state. For suppose J >0. Then the decay to the ground 
state of lithium is allowed if the «-particle emerges in an | wave 
with 1 =J or | =J — 1 and such that (—1)! = w, and the decay 
to the 2°19 MeV state would be allowed also. However no decay 
into the 2:19 MeV and ground states is possible from a J = 0, 
w= +1 state, because this would require an 1 wave with / odd, 
which would violate the conservation of parity. We conclude there- 
fore that the B!* state may boa J =0,w=-+1, 7 =0 state 


23 
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containing an admixture of J’ =1. Actually other evidence sup- 
ports the 7 =1 alternative suggested first with a little 7 —0 
mixed in with it, and the non-occurrence of the last two reactions 
(30.21) then indicates that either the B* state has J = 0, 
w = +1 for the same reason as before, or that the admixture of 
7 = O in it is very small indeed (Malm and Inglis 1954). 


The second angular distribution theorem 


Consider a reaction which proceeds predominantly through a 
particular 1 wave in the incident beam of particles (30.19). The 
initial wave function can be written 


Pom = f(r) Yim(8, 2) $5 
where ¢, describes the particular spin state p of the incoming 
particle and of the initial nucleus. Similarly the final state is 


etkr 
oom = Fan(s #) (<5) 


where ©, gives the spin state of the final nucleus and the final 
particle. Suppose that under a rotation the various functions 
transform with the following unitary matrices: 

function:— Yim, dys Foams. &,; 

matrix:— De, - tye Discainene, ee 


Since by the fundamental theorem Y;,, and VY ,., transform in 
the same way, we have analogously to (30.17) 
Die top" Ow’ = Darmo'xm’T. aa" 
Solving this equation we obtain 
Darme'x'm's = Dinmtoe Ta”. 
Thus the functions om(9, $) = > |Fam|? transform in the same 
Ap 


way as 
Yim* Yiml 2 $,*¢,) 2 ®,*9,). 
p 


As in (30.18), each bracket is invariant under rotations, so that the 
Om transform like 


D® x DY => Dh 
where the highest value of LZ that can occur.is 21. If we choose the 


z-axis parallel to the incident beam, the reaction cross section 
o(9, d) is just og, and there is no dependence on ¢. Hence a(6) 
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expressed in powers of cos 8 can contain no term higher than (cos 8)*', 

where 1 is given by the l-wave in the incident beam through which the 

reaction proceeds. This is the second angular distribution theorem. 
Consider for instance the reaction 


Li*(1, +, 0) + «(0, +, 0) > B19(2, —, 0, excited state) 
—> Be + y, 


where the bracketed figures are the quantum numbers J, parity 
w, T respectively. If we consider only the [8 component in (30.19), 
the initial wave function transforms according to DY x DW 
x D) under rotations, so that we must have / at least 1 to obtain a 
J = 2 state in the compound nucleus. Also / = 1 gives the correct 
parity, so that the p-wave dominates the reaction. The angular 
distribution of the final y-ray will therefore have the form 


o(8) = a + 6 cos? 8. 


Of course in practice the argument is used in reverse. The observed 
angular distribution is used to obtain information about which 
l-wave dominates the reaction, and hence to limit the possible J 
value and parity of the compound nucleus. Sometimes they can be 
determined unambiguously in this way. 


Angular correlation of successive decays 


Suppose that a nucleus NV goes through two successive y-decays 
from an excited state a through the state b to c. 


Na —> No + y(Ly), 
Nyce Noch he). (30.22) 


Here L, and L, give the angular momentum #Z carried off by the 
y-ray in what is called a 2”-pole transition. If the life time of the 
state b is very short, then external influences do not have time to 
affect the nucleus in its b state, and the actual final quantum state 
of the first reaction becomes the initial state for the second. This 
leads to a relationship between the two decays. We shall choose as 
our z-axis the direction in which the first y quantum has been 
observed, and shall denote by o(8) the angular distribution of the 
second y quantum relative to the first one. If o(6) is expanded in 
powers (cos 9)” of cos 6, we shall prove that the highest power 
Vmax 18 given by 


Vmax = Min(2L,, 2L4, 2Jp or 2Jy — 1). (30.23) 
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Here 2J, applies when J», the angular momentum of the WV» state, 
is an integer, and 2J, — 1 applies when J» is half an odd integer. 

We first note that the forms of the matrix elements governing 
the absorption and the emission of a quantum are the same, so that 
all selection rules are the same, and for our purposes we may replace 
the reaction (30.22) by 


Na + (Ly) > No > Ne + y(Le). (30.240) 


Similarly the miscroscopic probability of a reaction going one way 
is the same as for the opposite way, and we may consider 


Secondly, a photon state y(Z) has angular momentum AL and trans- 
forms according to D') (problem 30.3). We may therefore treat 
it as a spinless particle in an orbital L-wave. We can now apply 
the first and second angular distribution theorems to each of the 
reactions (30.24), and the result (30.23) follows immediately. 


Nucleon m-meson scattering 
We shall now calculate the relative cross sections of the reactions 


at +p >ont+p, (30.254) 
am +p>7 +n, (30.25b) 
aw +p>a +p. (30.25c) 
Consider first (30.25a). The initial wave function is 
Pia = ya(orb, spin)rté, = W1F5/2,9/2 (30.26) 


where y¥ depends on the orbital and spin variables ry, ozp, r, of 
the initial particles and represents the meson beam incident on the 
protons. 5/5,5/. is a pure isotopic spin function with JT = 3, 
since the 7™ and the é, transform according to D® and D(/2) 
under isotopic spin rotations respectively (Table 35) and My has 
to be 3/2 because we have a proton (Mr = 3) and w+ meson (U7 
= 1). The initial function (30.26) turns under the effect of the 
strong interactions into the final state 


Pra = pe(orb, spin)Es/2,5/2 = or wWtE,. (30.27) 


which again can only consist of a w+ meson and a proton. Here 
ye represents the outgoing beam (including any unscattered part) 
and has a certain angular dependence which determines o(8, ¢). 


NUCLEAR PHYSICS 347 


If we now assume that all three reactions (30.25) are being carried 
out at the same energy under identical geometrical conditions, the 
initial state for reactions (30.25b), (30.25c) is 


Fro = ya(orb, spin)7é, 
= palr/ (4) 23/2,-1/2 + AVA ($)Ey2,-1/2)) (30.28) 


where we have expressed the product w-€, in terms of T = 3 
and J' = } functions using the Wigner coefficients in the manner 
of (9.8). The numerical values of the coefficients have been taken 
from appendix I. The reason for writing (30.28) in this way is 
that the two terms in it change with time under the influence of 
the strong interactions quite independently. In fact we shall assume 
that the T =} component is affected only so slightly by the 
interactions that the final wave function is the same as the initial 
function, so that this component of the meson beam goes straight 
on without any appreciable scattering. We shall justify this assump- 
tion later. However we shall suppose that the 7’ = 3 term in 
(30.28) does change with time due to the interactions, so that it 
does give scattering. In fact it is related to (30.26) by isotopic 
spin rotations and changes finally into +/(4)¥t53/2,-172 analogously 
to (30.27). The final wave function for reactions (30.25b), (30.25c) 
is therefore 


Pry = Vv ($)}1Z3/2,-1/2 = JV (&) 151 /2,-1/2 
= VV (dbl Fr _ + (4) E] 
+ (the unscattered 7’ = } term), (30.29) 


where we have used the Wigner coefficients, this time in the manner 
of (9.7) to split 2,5 4/2 up again. The important point is that the 
orbital and spin wave function y¥; in (30.29) is the same as in (30.27) 
so that the cross sections for the three reactions are proportional 
to one another. Moreover the probabilities of forming the states 
a- +, 7° +n and 7 + p are proportional to the squares of the 
amplitude coefficients in (30.27) and (30.29). Hence the cross 
sections for the three reactions (30.25) are in the ratio 1 : 2/9 : 1/9, 
ie. 9:2:1. This is in agreement with experiment over the range 0 
to 300 MeV of meson energies, which justifies the assumption that 
the T' = } term leads to only a small amount of scattering in this 
range. The reason for this can be understood vaguely by thinking 
of the compound nucleus picture. If the proton and 7-meson formed 
a definite “compound nucleus” in a defined excited state, this would 
have a particular 7 value (7 = % in our case) and so would the 
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final reaction products. Now the proton and w-meson do not form a 
relatively stable compound nucleus state, as evidenced by the 
fact that the scattering cross section does not exhibit a sharp 
resonance peak at some energy. However, we can still think of this 
tendency towards forming a 7’ = $ compound nucleus as persisting 
and dominating the scattering in our energy range. For energies 
of 800-1200 MeV the 7 =} scattering predominates over the 
T = $ and in the range 300-800 MeV the two processes are com- 
parable (Gell-Mann and Watson 1954). 
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Summary 

If in a reaction process the interaction Hamiltonian is invariant 
under some group of transformations, then the initial and final 
states in the reaction have the same transformation properties 
with respect to the group of transformations. This has been used 
for isotopic spin rotations, spatial rotations, and space inversion. 
Some miscellaneous examples have illustrated its application to 
selection rules for reactions, the angular distribution of the reaction 
products, and the ratios of some related cross sections. 


PROBLEMS 


30.1 # is invariant under some group of transformations, and 
#,(F, t) is a set of wave functions at ¢ = é, transforming according 
to the representation D. Using the time-dependent Schrédinger 
equation, show that 2,,/0f, 0°y,,/0t? and all higher time derivatives 
at t =t, also transform according to D. Hence using a Taylor 
expansion for %,(r, #), show that the %, transform according to 
D at all times f. 

30.2 # is invariant under the space inversion J7. Hence show 
that 14’ = #II and d<IDdt = 0, where <JID is the expectation 
value of J7 in a certain state. Hence show that a state with even 
(odd) parity at one time continues to have even (odd) parity for 
alltime. Generalize this argument to give a proof of the fundamental 
theorem of this section (see problem 17.3). 

30.3 Show that Maxwell’s equations have linearly independent 
solutions represented by the vector potentials 


Aju = O,k-* curl Ippuzy, 


M e 
Aru = tC Nomttm, 
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where Uzyg = jr(kr) Ypa,(8, $) exp (tkct), and 
Tory = —tr A grad 
is the vector operator (8.25). Show that 
PALa = (2? + Ty? + [2)ALy = L(L + 1)Azy, (30.30) 


and that the AZ transform according to (8.18). Here I is the total 
infinitestimal rotation operator. 


I= Torn a Teomp- 


It includes the change I,,,,, in the co-ordinate axes to which the 
components of the vector A are referred, as well as the change 
I,,p in the orbital variables. (See § 32 for a somewhat more detailed 
description of the difference between Ig, and Iygmp-) Also show 


Keomp“Lu = S8(8 + Atty = 2ALu- (30.31) 


The fields AZ,,, AM, are called pure electric and magnetic multipole 
fields, and basic photon states can be defined in terms of them. 
(30.30) shows that the total angular momentum is 4/[Z(L + 1)]h, 
and (30.31) that in some sense there is a spin angular momentum 
of one unit. However, the separation of the total angular momentum 
into orbital and spin is not as definite as in the case of an electron, 
for the Aj are not eigenfunctions of I, comp, Jz orp, nor I?» 
(Siegbahn 1955, p. 376). 

30.4 Neglecting the mass difference between protons and 
neutrons, show that the probability of a nucleus making an electric 
dipole y-ray transition involves the operator 


2 (tae + $)Pe = > TzkT kz, 


where rz is the #8 nucleon co-ordinate relative to the centre of 
mass. Hence, show that the 5-1] MeV excited state of B® with 
J = 2, w= —1, T =0 cannot make an electric dipole transition 
to the ground state J = 3, w = +1, 7 =0. 

30.5 What would the ratio of the cross sections be for the 
reactions (30.25) if only the 7 = } state contributed to the scat- 
tering? What is the ratio of the cross sections for p + d—at 
+ H* and p + d 7° + He® assuming charge-independent forces? 

30.6* Discuss the assignment of quantum numbers to the 5-11 
and 5:16 MeV excited states of B1°, particularly with reference to 
the reaction of Li®(ay)B1° (Jones and Wilkinson 1953, Ajzenberg 
and Lauritson 1955, Radicati 1953). 
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30.7* Set up the selection rules governing the emission of a 
y-ray (Siegbahn 1955, p. 373; Gell-Mann and Telegdi 1953). 

30.8* Find and discuss some examples in Ajzenberg and Laurit- 
son (1955) of selection rules being used to determine the quantum 
numbers of excited states of nuclei. 

30.9* A nucleus in an excited state undergoes two successive 
y-decays. Outline the detailed theory of the angular correlation 
between the y-rays, discussing in particular the importance of 
symmetry properties (Siegbahn 1955, p. 531). 


30.10 A,, Ap and A_ are the amplitudes for the reactions 
at +9 > D+ + Kt, 
a +p > 2° + Ke, 
a +p>2 4+ Kt, 


Using the isotopic spin assignments of Table 35 (p. 332) and the 
assumption of charge independent forces, show 


4/2Ag = A, — AL 
and hence that the differential cross-sections satisfy 
[2c0]* < [o,]* + [o_]* 


at all angles of scattering. 


Chapter VITI 
RELATIVISTIC QUANTUM MECHANICS 


31. The Representations of the Lorentz Group 


Proper Lorentz transformations 


If a flash of light is emitted at the origin of co-ordinate at time 
t = 0, the light wave will travel radially outwards in all directions 
with a uniform velocity c. The equation of the wave front is 


x? + y? + 22 — c%f? = 0. (31.1) 


Let us now consider an observer moving with uniform velocity 
v relative to the co-ordinate system Ox, Oy, Oz, and let the origin 
of his new co-ordinate system 2’, y’, 2’ be chosen to coincide with 
O at time t’ == t = 0. Now according to the principle of the special 
theory of relativity, the velocity of light is independent of the 
motion of the observer, so that an observer at rest in the second 
system of co-ordinates also sees a spherical wave front 


a2 4. yh + 2/2 — oy’? — 0, (31.2) 


Thus from (31.1) and (31.2) a transformation of co-ordinates from 
2, y, 2, t to 2’, y’, 2’, t’ moving with relative velocity v, leaves the 
quantity x? + y? + z? — c%? invariant. Such a transformation is 
called a Lorentz transformation, in particular a proper Lorentz 
transformation if it is achievable physically and does not involve 
an inversion of the space or time co-ordinates. If we put 7’ = tet, 
the invariant quantity becomes 2? +- y? + 2? + T?. Now the charac- 
teristic feature of a rotation in three-dimensional space is that it 
leaves x? +. y2 +z? invariant (cf. equation (3.4); problem. 3.7). 
Thus by analogy we can regard the proper Lorentz transformations 
as rotations in the four-dimensional x, y, z, T space. 

For instance the transformation L(v,z) to a system 2’, y’, 2’, T 
moving with velocity v along the Oz axis ist 


f 


_ _@ +2’) _ _t — ve']e™ 
7a — yey VT whl 


+ This can be proved by a more detailed consideration of (31.1) and (31.2). 
See any text discussing the special theory of relativity, e.g. Goldstein 1950, 
p. 188; Stratton 1941, p. 76. 
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and this can be written in the form of a rotation in the 2T'-plane 
by an angle 6. 


ee ee, :: oe 
Vila wh Vey ee ee 


31.3 
Be’ rT oe) 


a oa ae —_—_—__— = 2’ gin @ n 
T Vas) + Vas aon + T’ cos 6 


tand=i8 or tanh (ié) = —f 
where 8 = v/c. (31.4) 


Thus by analogy with (8.3) we can write 


L(v, 2) = exp(i6Iz7) (31.5) 


in terms of an infinitesimal rotation operator Iz7 in the 2T-plane. 
Here 6 is restricted to be an imaginary number as shown by 
(31.4) so that the transformation is completely real when expressed 
in terms of 2, y, z and ¢. Clearly the ordinary rotations in z, Y, 2 
space (which we shall call pure rotations) can be included as a 
special kind of Lorentz transformation, and the infinitesimal rota- 
tion operators I,, Iy, Iz of § 8 become Iyz, Izz, I. zy in the notation 
of (31.5), since a rotation about the z-axis is a rotation in the yz- 
plane. If the two equations (31.3) are written in reverse order, 
they can be interpreted as a transformation exp(—i6I7z), so that 
In; = —I,7 and in general 


Iw = —Ly- (31.6) 


Thus symbols such as Jz, which have not been defined so far must 
be considered as zero. We therefore obtain six independent 
infinitesimal Lorentz transformations Izy, Iyz, Izz, Izr, Iyr, Ler. 

It is not difficult to prove that the product of two Lorentz trans- 
formations is also a Lorentz transformation. In this way by com- 
bining the simple transformation we have mentioned, we can build 
up a whole group of transformations. This is called the proper 
Lorentz group 1 and contains as a subgroup all proper rotations. 
However it does not contain the time inversion ¢ = —?’ nor the 
space inversion JJ since these cannot be represented purely in terms 
of rotations about the x, y, z, T' axes. 


Commutation relations 


All the commutation relations between the above six infinitesimal 
rotation operators about the z, y, z, 7’ axes can be deduced directly 
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from the commutation relations between the ordinary infinitesimal 
rotation operators of § 8. For consider the rotation R(«, x) = 
exp(tal yz), 

c= 2! 

y=y' cosa — 2’ sina (31.7) 

z2=y' sna +2’ cosa 
and two other rotations R(B, y) = exp(iBlzz) and Rly, z) = 
exp(tyIzy). The relationship between these three rotations is 
completely determined by the three sets of equations analogous to 
(31.7). In particular in § 8 we derived the commutation relation 

Teylyz — Iylday = U2 (31.8) 


from these equations by a procedure which effectively consists of 

writing down the relation between R(a, x), R(B, y), R(y, 2) when 

a, B, y have some special simple values. We can now replace z by 

T in (31.7) and throughout the argument and obtain from (31.8) 
Leylyt — Lyrl ay ~~ tz = —tlyr. 

in general 


Lyldvp — Ll = —tI,, — (suffices not summed). . 
(31.9a) 


Transformations such as exp(iaIzy) and exp(i@Iz7) commute since 
they alter only the x, y and z, 7 co-ordinates respectively, whence 
from (31.5) and (8.3) 


TyL pq — Tpel yy = 9 (tt, », p, @ all different). | (31.9b) 


Any desired commutation relation can now be derived from (31.9a), 
(31.9b) or a trivial relation such as 


Lyl yy ~ Luly = 9  (notsummed). (31.9) 


Irreducible representations 


Instead of using the above infinitesimal Lorentz transformations, 
it is more convenient to define the linear combinations 


A'y = $(Lyz ae Izr), A’y = 4 (Lex + Ly), A", = 4(Lay + Iz), 
A’, =A’, + iA’y, A'_=A’',;—iA’y 
A,=—A'., A.=—A',, Az=—A’% (31.10) 
Bz = $(Lyz a Iz), By = 4(Lez = lyr), B, = (Iey oa Ir), 
B, = B; + iBy, —— B,z = +By. 
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Now it can easily be verified from (31.9) that each A and A’ operator 
commutes with each B operator, e.g. 


4(A’,By ~ A’yBz) 
= (Iyzelzx a, LzzI yz) + (Ilerlzz = Izzlz7) 


— Lylyr — Iyrlyz) — Terlyr — Iyrlzr) 
= —tlyz pacar! tL 2 == Ler —_ tly = 0. 


Also each set A,, A_, Az; A’,, A’_, A’z; B,, B_, Bz satisfies the 
same commutation relations (8.10) as the infinitesimal rotation 
operators J,, I_, Iz 


In, o— II, = bare 
em _ id, = —I_, (31.11) 
0 Gee Ge faa) 


We can now use the operators A,, A_, dz and B,, B_, B, satisfying 
the same relations (31.11) to deduce the irreducible representations 
of the Lorentz group, in the same way as the infinitesimal rotation 
operators were used in §8 with respect to the rotation group. 
Consider a given vector space which is invariant under the Lorentz 
group. It is then also invariant under the infinitesimal operators 
(31.10). We first reduce the space with respect to the commuting 
operators A,.and B,, and from among the eigenvectors belonging 
to the highest eigenvalue j of A, we pick out the one with the highest 
eigenvalue j’ of Bz. As in § 8, we can construct from this vector a 
total of (27 + 1)(2j’ + 1) standard base vectors umm’ such that 
(cf. (8.18)) 


A Umm = V(I9 + 1)—m(m + 1)) Uma,’ 
A_Umm’ = V{9H(9J + 1)—m(m — 1)) Urn’ 


Agimm' = MUmm' 


(31.12) 


Bytinn' = VtI'(9 + 1U)—m'(m' + 1)) Umm’ 
B_uUmm' = VJ 19'(7' + 1)—m'(m' _ 1)j Um,m’—1 
Baan! =m Unn' 


As in § 8, these vectors form the basis of an irreducible representa- 
tion of the proper Lorentz group which we shall denote by DW), 
where j, j’ are integers or half odd integers. Any finite representa- 
tion is reducible into a sum of these. 
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Examples 


A little manipulation shows that the quantities x, y, z, t transform 
according to the representation D‘+*), In fact the standard base 
vectors are 


Uy, =e2-e=2z—17, U, =“ — ly, 
U4. =x-+ ty, “4 =—z+¢d = —z2—10T, 
(31.13) 


where for simplicity we have written u,, for Uiije sie From 
(8.11) the infinitesimal Lorentz transformations can be expressed 
as differential operators 


é é f @ 7) 
Ixy = ~i(eF — =) Lit = ~ife xn —_ rs), ete. 
| (31.14) 
whence 


Aw,, = (—Iny — Ier)(z — 1T) = He — tT) = du, 


Similarly it may be verified that each of the vectors (31.13) satisfies 
(31.12). 

If Uyo, U_yo (abbreviated to u,o, u_o) transform as standard base 
vectors (31.12) according to D®), and u,, we_ according to 
D©®D, we can construct the set of base vectors 


a = Ce 
[G+ mG — mi8G + mig’ — mye? 


(31.15) 


These transform as standard base vectors according to D@®. This 
can easily be verified, for the infinitesimal operators being defined 
in (8.1) in the same way as a differential operator like d/dz, operate 
on products in a similar way (cf. problems 8.10, 8.11), e.g. 


1 uo, etc. 


A,ut,g=nuyy Aiuto =0, A_utg=nuty 
Quantities transforming like w,o, w_o or Uo,, Uo_ are called spinors, 
though this term is often also applied more generally to quantities 
of the type (31.15). 

Also the matrices representing the infinitesimal Lorentz transfor- 
mations can be written down from (31.12). Hence the matrix 
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representing any desired Lorentz transformation can be calculated 
using the method applied to D/2) in § 8. 


The complete Lorentz group 


The proper Lorentz group I considered so far can be augmented 
by combining each proper Lorentz transformation with the space 
inversion J], the time inversiont 7, and the space-time inversion 
Ir = rIT 


(31.16) 


Such transformations are called improper Lorentz transformations» 
and together with the proper ones they form the complete Lorentz 
group §. 

Let LZ be any proper Lorentz transformation. Then Z can be 
made up as a product of simple transformations exp(t@I,,,) which 
are pure rotations or simple Lorentz transformations along one of 
the co-ordinate axes. In § 8 (equation (8.22)) this was shown for 
any rotation. In the present case we can rotate the system such 
that the relative velocity is along the z-axis, apply L(v, z), and then 
rotate again to any desired orientation. Now if we reduce each of 
the angles @ in these transformations to zero, L changes continuously 
into the identity transformation HZ. This cannot be done for any 
of the transformations (31.16) since these are inversions which in 
some sense turn our co-ordinate system “inside-out”. However, 
IIL or LIT can be reduced continuously to JT. In this way the 
complete Lorentz group is seen to contain four branches |, J/I, 
ai, Iz! consisting of all transformations that can be reduced con- 
tinuously to H#, IZ, +, [Ir respectively. Any transformation from 
tl combined with any transformation from IT! gives one belonging 
to lvl, etc. Thus, since 7? = +r? = (ITr)? = EH, the four branches 
[+ J! + 71 + rl together form a group, the complete Lorentz 
group §, as stated above. Similarly I and J/I form the space-inver- 
sion Lorentz group, and [+ cl and I + JI7I also form groups. 
However, [ + I7{ + J771 does not form a group because it contains 
the elements JJ and + but not their product [77. In the following 


+ It should be noted that 7 is the simple time-inversion operator and not 
the usual time-reversal operator of quantum mechanics defined in § 19. 
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we shall only concern ourselves with the complete Lorentz group 
£, the restriction to the various subgroups not being difficult. 


Representations derived from D@» 

Consider first the group #, II, +, II7, where I]? = 7* = (IT)? 
= H, II(ITr) = 7, ete. This group is Abelian and isomorphic with 
the point group 222. Its only single-valued irreducible representa- 
tions are the four one-dimensional ones shown in Table 36, as can 
easily be shown from the relations of § 14 or by the methods of §7 
(problem 7.4). 


TaBLE 36 
Single-valued Irreducible Representations of HZ, IT, 7, [Ir 


x(#) x(7Z) x(t) x(1Tt) 


2 
» 
a) 
_ 
| 
wes, 
| 
— 


We have already shown that x, y, z, T or, in standard form, the 
linear combinations (31.13) transform under the proper Lorentz 
group according to the irreducible representation D@), Now these 
base vectors are also transformed into one another by the operations 
IT, +, Ir (31.16), and therefore form the basis of a representation 
of the complete Lorentz group which we shall call D@+, For 
instance the matrix representing JT with respect to the standard 
base vectors (31.13) is 


4 

] 
Now from this representation we can obtain three other representa- 
tions Dit), r = 1, 2,3 as follows. We simply multiply the matrices 
of all transformations 7’ belonging to the branches [, JI, zl, IT7l 


by xr(#) = 1, xr(IZ), xr(7), xr(II7) respectively, where the x, are a 
row of characters from Table 36. Thus 


Darix( 7) — xr(a) DVO('T) (31.17) 
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where a = H, II, 7, Iv as T belongs to [, JI, zl, vl. This really 
does give representations. For suppose 7, 7, belong to the 
branches J7I, zl: then 7,7, belongs to I77I, and from Table 36 


Xr( IT) D440) ( 7 )yp(7) D440 (T,) = x-(IT7) D460 (7, T,) 


as required. It is shown in appendix H that the four representations 
DG) r=0, 1, 2, 3 are inequivalent and the only ones which 
reduce to D“? for proper Lorentz transformations, so that we have 
exhausted all the possibilities in this direction. Components trans- 
forming like x, y, z, TJ’ according to the representation D(##0 
are said to form a regular four-vector; and if according to D(tt), 
r =], 2, 3, then a pseudo-vector of type r (Watanabe 1951). 


The representation D(‘+0+04) 

Let to, U-o and uo,, w%— transform respectively according to 
D®) and D‘%) under the proper Lorentz group. We shall now 
obtain from them a representation of the complete Lorentz group. 
From (31.15) the vectors 


UzoUo+, Unto+, U+oUo—, U-oU%o— (31.18) 


transform under the proper Lorentz group according to D(ti- 
Let us also suppose that wo, uo, Uo4, Uo— are so related that the 
vectors (31.18) are also transformed into one another by J7 and +. 
They therefore transform under the complete Lorentz group accord- 
ing to one of the representations D‘#”), and let us suppose for the 
present that its is D+. Then as far as their transformation 
properties are concerned, the vectors (31.18) may be identified with 
the vectors (31.13). From this identification we get 


T U_qto+ = U_gttais T U_guo— = —Uyz gto: (31.19) 
From these relations it can be seen that if 7+ induces a linear trans- 
formation among the 1,5, U_9, %o4, Up_, then we must have 
Tibyg = Ag, TH, = —(I ayo, 
TUu_g = —AUg,, TUg_ = (l/a)u,o, (31.20a) 


where a is a numerical constant. In order to obtain a particular 
representation we shall put a = 1 and thus 


Ths9 = Uo_» TUgy = —UL_o, 
TU_y = Ug4, Tug == U9. (31.20b) 
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Similarly we can consider the effect of H, and from (31.13) and 
(31.18) we obtain 


TT qlo, = U_-or%o-» TTu, Qo = —%+040— 
TTu_gto, = —U_oMo+ TTu_gltp_ = Uso%o+ (31.21) 
and analogously tof (31.20b) 
TTu4,5 = ~—My_; TTug, = U9, 


Thus the vector space (us, U_o, Uor, Uo_) 18 invariant under the com- 
plete Lorentz group, and forms the basis of a representation which 
we shall denote by D‘t0+0t4), 

Furthermore the representation D(#o+04) is irreducible. For 
suppose it were reducible: then considering the proper Lorentz 
transformations it could only be reducible into D@® +- D% but, 
from (31.20), (31.22), the subspaces (u,9, %) and (uo,, %_) are not 
invariant under JJ and 7. It can also be shown now that the different 
representations obtained by choosing different values of a in (31.19) 
are all equivalent. They can all be obtained from our particular 
one (a = 1) by a continuous change of the parameter a. By con- 
tinuity the representations cannot suddenly change from being 
equivalent to being inequivalent, so that they are all equivalent, 
(unless there were an infinite number of inequivalent representations 
of dimension 4). 

Above we deduced a total of four representations Dt), r 
=, 1, 2, 3 from the representation Dt by introducing the 
factors yr (Table 36) into the matrices of the various branches 
(equation (31.17)). We can now do the same thing to Dih0+ 0434) | 
However the four representations turn out to be all the same and 
just D(t0+08:! For the pairs uo, u_» and u9,, Ug both transform 
under pure rotations according to the representation D‘/®) of the 
rotation group (problem 31.3), so that like D\/® the representations 
D0 and Dt) of | are double valued. Each element T of the 
group § is already represented in D‘+0t:) by two matrices +D(T) 
and —D(T), so that introducing the various changes of sign of 
Table 36 does not give new representations. However, because of the 
double-valuedness we can obtain some other representations by 
using the double-valued xy of Table 37 instead of the single-valued 
ones of Table 36. Thus we get the four representations D(#0+0t:), 
r = 4, 5, 6, 7. 

t The constants in (31.20) and (31.22) are not completely independent 


because JTt has to be represented by a multiple of the unit matrix ( appendix 
Hi). 
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TaBLE 37 
Double-valued Irreducible Representations of E, I, 7, [+r 


x(Z) x( 17) x(t) x(ITr) 


Xe +1 +1 +1 +1 
Xs #1 +1 + +t 
Xe +1 +4 +1 +4 
Xa +1 +1 +4 +1 


This brings up a point which has some important physical con- 
sequences. The four representations D(#t7), r — 0, 1, 2, 3 are all 
single-valued and differ by changes in sign which certainly are often 
important. However the D‘i0+0iir), , — 4, 5, 6,7 are double-valued 
with no distinction between transformations differing in sign. 
This suggests that for physical applications at least, factors of -L¢ 
may also be unimportant. I.e. there may be no point in distinguish- 
ing between the various D‘#0+04) since these differ only by factors 
of + in some of the branches of £. This is definitely known to be the 
case when dealing with a single spinor field like in the theory of 
electrons, positrons and electromagnetic radiation: it makes no 
difference to any of the physically meaningful results which re- 
presentation D‘t0+0t:r) one uses to describe the electron-positron 
fieldt (Jauch and Rohrlich 1955; see also discussion below equation 
(11.23a)). From now on therefore we shall adopt the representation 
with r = 6 as our standard one because it gives the various factors 
+ in the most convenient places, but we shall usually refer to it 
simply as D‘#0+0t) go as not to imply any significant distinction 
from the other D(t0+08;r), — 

For the usual quantum mechanical applications it is convenient 
to introduce in the vector space (t+, U_9, U9,, Uo_) @ new set of base 
vectors 


(31.23) 


} However it may be physically meaningful to distinguish between the 
Dit? + 047), r = 4, 5, 6, 7 in a theory involving more than one type of spinor 
field, e.g. in a theory of electrons, nucleons, neutrinos, 4 mesons and their 
antiparticles all in interaction with one another (Yang and Tiomno 1950). 
We shall not consider such a situation. 
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Adopting D‘t0+0t:6) now instead of D(#+08®) in accordance with 
our convention, we introduce a factor i in (31.22) and get 


lu, =4u,, JIIu=u_, Mv, = —»,, Z (31.24) 


TU, =—U,, UW=-—U_, TWH=Y%,, . | (31.25) 


It can easily be shown that the pairs u,, u_ and v,, v_ each transform 
under pure rotations as standard base vectors according to the representa- 
tion D“/2) of the rotation group. For instance I, of § 8 is Izy in the 
present notation, and we have 


Tuy, = Igyu, = (Be — Az)(+/$)(ug, + uo) from (31.10), (31.23), 
= ju, from (31.12), (31.23). 


In this way the relations (8.18) with 7 = $,m = +4 may be verified 
for each of the pairs u,, u_ and v,, v_. For the sake of future 
reference, we also note that the matrices representing Izy, Iyz 
and I,7 with respect to u,, u_, v,, v_ are $M, 4N and 4K respec- 
tively, where 

| . 1 


ms a 
<1 ee ae (31.26) 


Other representations 


The following is a brief description of all the finite irreducible 
representations of the complete Lorentz group £. They are derived 
and described in detail in appendix H. Consider the base vectors 
tmm’ transforming according to D”) under I. Then the vectors 
Tumm' transform according to D9’), Thus when j 47’, to get a 
representation of £ we have to take the Umm’ and rumm’ together 
to form a representation D‘#’+d’i+) of dimension 2(2j + 1)(2j’ + 1). 
When j is an integer and j’ half an odd integer or vice versa, the 
representation is double-valued. In this case r takes the values 
4, 5, 6 or 7, giving representations related to one another in the 
same way as the D(t0+0t:r) by (31.17) and Table 37. When j, j’ 
are both integers or both half odd integers, the representations 
D‘si'+'5) are single-valued and r takes the values 0 and 1. The 


362 GROUP THEORY IN QUANTUM MECHANICS 


representations with r = 2, 3 can also be constructed from (31.17) 
but are equivalent to those with r= 1, 0 respectively. When 
j =Jj', we obtain four representations of £, i.e. DY"), r = 0, 1, 2, 3. 
These have dimension (2j + 1)? and are directly analogous to the 
D‘t"), In all cases products of regular vectors are taken to trans- 
form according to representations with r = 0. 
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For further details about the proper and complete Lorentz groups 
the reader is referred to appendix H and to the following: Van der 
Waerden 1932; Boerner 1955; Watanabe 1951; Jauch and Rohrlich 
1955, pp. 82 ff. (time-reversal); Murnaghan 1938; Harish-Chandra 
1947; Wigner 1939; Heine 1957a. 


Summary 


Lorentz transformations can be regarded as rotations in the 
four-dimensional space 2, y, z, J’ = ict. This leads to the irreducible 
representations DJ" of the proper Lorentz group in the same way 
as the representations of the rotation group were determined in 
§8. D9" has dimension (27 + 1)(27’ + 1) and a standard set of 
base vectors umm’ transforms according to (31.12). In particular 
x, y, z, # transform according to D‘#®) under proper Lorentz trans- 
formations, the linear combinations (31.13) being standard base 
vectors. The complete Lorentz group £ includes the space and time 
inversions. There are four types of regular and pseudo-vectors 
corresponding to the representations Dt), r—0, 1, 2, 3 of §&. 
Thus 2, y, z, t (or ict) form a regular vector transforming according 
to D{%#, The spinors u,, u_, v,, v_ used in an electron-positron 
wave function may be taken to transform according to any one of 
the representations D(40+0bir), » — 4, 5, 6, 7 so that we shall write 
just D‘#0+0d), All the other irreducible representations have also 
been described. 


PROBLEMS 


31.1 Show that the combination of two Lorentz transforma- 
tions is itself a Lorentz transformation. 

31.2 Show that exp(i0,Jzr) x exp(t0,Jz7) = exp[t(6, + 4,)Iz7], 
and hence deduce the relativistic law of composition of velocities 
v = (Vv, + U2)(1 + 0,0,/c°). 

31.3 Show that I,, = 4(—A, — A. +B, + B_) and hence 
calculate in the manner of § 8 the matrix representing R(@, x) in 
the representation D‘® in standard form. 
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31.4 Using the method of problem 8.16, show that if Umm’ 
transforms as a standard base vector according to Di", then 
U*mm' transforms like um'm according to DY’). Hence show from 
(31.23) that u,*, w_*, v,*, v_* transform in the same way as v_, 
—v,, —U_, U,. 

31.5 In the manner of § 9, show that 

DGD x Db = Nav 4+ Ya 4 POV +4 P00), 


According to what representation do the six components of the 
angular momentum transform? 

31.6 Show that the representation Ds" is single-valued if j 
and j’ are either both integers or both half odd integers. Hint: 
use the reduction of product representations such as in the previous 
problem. 

81.7 Write down some physical examples of the various regular 
and pseudo scalars, vectors and tensors. 

31.8 Show directly that w,o, u_o transforming according to 
D‘t0) under proper Lorentz transformations, cannot possibly form 
the basis for a representation of the complete Lorentz group. Hint: 
#49, U_9 transform according to D') under pure rotations: Hence 
from Schur’s lemma, JT must be represented by a multiple of the 
unit matrix. Also JTL(v) = L(—v)II, but this leads to a contra- 
diction since L(v) and L(—v) are not represented by the same 
matrix. 

31.9 Show that a set of vectors transforming according to 
D‘4i" under the proper Lorentz group transform under pure rotations 
according to the representation DY) x D‘i” of the rotation group. 
Hence show that DJ”) is double-valued if 7 is an integer and j’ 
half an odd integer. 


32. The Dirac Equation 


Relativistic wave equations 

To set up a relativistic form of quantum mechanics to describe 
electrons or any other kinds of particles, we proceed as follows in 
complete analogy to the usual non-relativistic form. In relativity, 
the momenta pz, py, pz are the first three components of a four- 
vector p, whose fourth component is pr = if x/c where Hx is the 
kinetic energy.t The square of the length of this four-vector is a 


+ In accordance with the previous section, we take 2, y, z, tct = T as the 
four co-ordinates. We shall use » etc. to denote z, y, z or T, e.g. py, and 8/dp. 
The suffix j will be used to denote z, y or z (not 7), and bold type an ordinary 
three-vector: thus p? = p43. 
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constant — mc? (Goldstein 1950, p. 203). 

PuPn = —m*c? (32.14) 
or Ex? = p’c? + mic, (32.1b) 


In quantum mechanics we have to interpret the p, as operators, 
and corresponding to (32.1a) we have the relativistic wave equation 


(PuPy + mc?) = 0. (32.2a) 


The quantity p,,p, is a world scalar, so that (32.2) is invariant under 
all proper Lorentz transformations. As usual, we adopt the 
Schrédinger representation and put 


a 
Py = —ih Ba (32.3) 


so that (32.2) becomes the differential equation 


a a ae 1 @ 
mS + By? + aa —-3 aa) + mcs = 0, (32.2b) 
to replace the usual non-relativistic Schrodinger equation. If now 
we put % = g(r, #) and regard % as a wave function describing the 
state of one particle, we would appear to have a satisfactory, relativis- 
tically invariant theory. 

For one particle this is more or less true, but a dilemma immediately 
appears when we try to discuss more than one particle this way. 
For two particles, for example, we would need six spatial co- 
ordinates r, and r,, and to preserve the fundamental relativistic 
symmetry between space and time, this would automatically 
demand the introduction of two time co-ordinates ¢, and f. Now 
physically in any one frame of reference, time is something which 
is the same for all particles, and no physically acceptable and 
useful way has yet been discovered of developing a theory with 
more than one independent time co-ordinate. However, a way 
out of the difficulty has been found by developing the quantum 
theory of fields, and it is easy to see in outline how, by considering 
the electromagnetic interaction energy between two moving charges 
e, and e,. The Coulomb energy is ¢,e,/7,., and the magnetic energy 
can also be expressed in terms of r,, rz and the momenta pj, Pp. 
This gives the above type of description which depends explicitly 
on the number of particles, and which cannot be made truly relativis- 
tic because of the single time co-ordinate. However, if we describe 
the system in terms of the electric and magnetic fields & and H, 


RELATIVISTIC QUANTUM MECHANICS 365 


the total energy becomes ${(&? + H?) dv, quite independent of the 
number of particles. Moreover, & and H depend only on the one 
set of variables r, ¢. We shall not follow this field approach any 
further here because it would rapidly lead us outside the scope of 
this book, but merely mention two facts that arise out of it. Firstly, 
& system with any number of particles can be described in terms of 
a quantum field operator Y(r, t) which, for freely moving particles, 
satisfies the equation (32.2) already derived, in this case called the 
field equation. Secondly, if the system contains only one particle, 
its motion can be described to a good approximation by (32.2) 
considered now as a wave equation for a one-particle wave function 
(r,t). We shall therefore discuss (32.2) as a wave equation, remem- 
bering that this is only an approximation as a one-particle wave 
equation, but that at the same time it has a much wider applicability 
as a field equation. It makes no difference to the symmetry pro- 
perties, of course, which interpretation we have in mind. 

In (32.2) the wave function ¥ (or field Y) may be a multi-compo- 
nent function with components ,(r, t). These can be the spin com- 
ponents of a spinor ¢ as in § 11, or the components of a relativistic 
vector or tensor function (like the components &,, &y, &z of the 
electric field @ in three dimensions). With such multi-component 
functions, a Lorentz transformation has a two-fold effect. It first 
transforms the variables r, ¢ to new ones r’, t’, and at the same time 
it transforms the components ¥, among one another. Here r’, ¢’ 
are just the co-ordinates in the new reference frame, of the same 
point P whose co-ordinates are r, ¢ in the old frame. This part of 
the transformation is not very interesting, so that we shall ignore 
it in what follows and just write 4,(P) or ¥, without specifying which 
co-ordinates r, ¢ or r’, t’ are being used for P. However, the trans- 
formation of the different components %, is more interesting. We 
note that it has to belong to one of the representations of the 
Lorentz group, because every transformation must preserve the 
type of function »% that we are dealing with. In § 11 the use of re- 
presentation D‘}) of the rotation group brought with it the whole 
concept of spin angular momentum. So here, we obtain different 
theories describing particles with different properties according to 
which irreducible representation the 4, belong to. The representa- 
tions D'r) of the complete Lorentz group (§ 31) give scalar and 
pseudo-scalar “mesons”, the z-mesons for example being such 
pseudo-scalar particles. A theory associated with D4) is said 
to describe generally vector or pseudo-vector “mesons”. In particu- 
lar photons belong to the representation D‘t#) since this is the 
representation appropriate to the relativistic electromagnetic vector 
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potential, which is the field that gets quantized and actually satisfies 
(32.2): with m =0. The double-valued representations Dito+03:9) 
are used in theories to describe all the spin $ particles such as elec- 
trons and positrons, protons, neutrons, anti-nucleons and pz-mesons. 
As mentioned in §31, all physical consequences are independent 
of r in D(t0+04:r), so that we just write D‘to+0}), 

In this section we shall develop the theory appropriate to elec- 
trons. From § 11, we know that an electron has an intrinsic degree 
of freedom described by the spin functions w,, u_, which transform 
according to D‘!) under purely spatial rotations. Under the proper 
Lorentz group this becomes the representation D‘t® (see problem 
31.8). However, all electromagnetic forces are known to be invariant 
under space inversion, so that we should really consider the complete 
Lorentz group. The representation D‘® has to be enlarged then to 
D‘40+04) and we shall now proceed to derive the theory based on 
this representation. Instead of using the pure component form 
y, for the wave function (or field), we shall switch at this stage to the 
alternative form of writing ys (see § 11) in terms of some set of base 
spinors, 


b= Pu. = wu, + ow + Ped, + pyr_, (32.4) 


where the ¥, are functions of x, y, z, ¢ only, and where for con- 
venience we shall take the u, to be the functions u,, u_, V1, v_ 
transforming according to D(#+0t®) as defined by (31.23) to 
(31.26).t As discussed already in § 8, the spinors wu, cannot be 
regarded as functions of ~, y, z, ¢ because of their peculiar trans- 
formation properties, e.g. the double-valuedness of the representa- 
tion. It may be helpful to think of them as functions of some 
hidden degrees of freedom which will turn out to be the spin direction 
and whether the particle is an electron or a positron. 


The Dirae equation 


We return now to the wave equation (32.2). As it stands, it 
contains one unsatisfactory feature in that it involves the second 
differential coefficient 07/22. This implies that to determine 
~(r, t) completely, it is necessary to specify both ¥(r, ¢ = 0) and 
ai(r, ¢ = 0)/a as initial conditions, whereas physically just the 
state y of a system as one time should be sufficient to determine 


+ Where it is useful to employ the summation convention, we shall write 
the spinors u,, U_, 04, V_ 88 Uz, « = 1 to 4, always with @ or f etc. for the 
spinor suffices. Thus the I,,28 below transform under the complete Lorentz 
group according to D(#:®) as regards the u suffix and according to D@°+%) as 
regards « and £. 
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the state of all later times.; We therefore make (32.2) linear in 
6p/0t by factorizing it. If we write 


(YuPy + tme)(y,p, — me) = 0, (32.5) 
then this is equivalent to (32.2a) provided we have 


condition I: Ve? = 1, yyy = —WYp Sor pF v. (32.6) 


It is also necessary that the y, commute with the p,, i.e. they 
cannot depend on r, ¢. Nor can the y, depend on the g, since this 
would destroy the required linearity in the p,. We conclude that 
the y, operate only on the extra spin variables of the spinors u,, 
and that they just transform the w, into one another in order to 
preserve the form (32.4) of the wave function. J.e. we have 


condition II: the y, operate only on the u, and leave the vector space 
(w,, U_, V,, V_) variant. 


We now suppose that, because of the factorization (32.5), % satisfies 


This is the Dirac equation. Clearly every solution of (32.7) is also 
a solution of (32.5) and (32.2). However, the converse is not neces- 
sarily true for all mathematical solutions of (32.2), but since (32.7) 
has the required linearity in pr = —(h/c)0/@ we presume that 
(32.7) gives all the physically relevant solutions of (32.2). To keep 
the operator in (32.7) still relativistically invariant, we further require 


condition III: the y,, transform according to D+ under the com- 
plete Lorentz group €. 


Incidentally, it is interesting to note that in the case of the electro- 
magnetic field we can use (32.2) direct without linearizing, since the 
time derivatives are determined by the fields through Maxwell’s 
equations, e.g. 8B/d = —curl &. 


The y,, operators 


We now proceed to write down explicitly a set of operators, y,, 
verify that these do in fact satisfy the conditions I to ITI, and then 


+ Note that in quantum mechanics the state of a system is not just a specifica- 
tion of the positions of particles, but includes as complete a description of 
all other variables such as the momenta, etc., as the uncertainty principle 
allows. ; 
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prove that the conditions I to III define the y,, uniquely so that the 
operators we have written down are not some set of y, operators 
but the only set. Consider the operators y, defined by the relations 


(32.9a) 


In an abbreviated notation these matrices can be written 


. t04 fi. 
=| ss | Pr=|! aa (32.9b) 


where o; are the Pauli matrices and | the unit matrix 


=|; 7], w=[; |, 


l. 1: 
=|! al =|? a) (32.10) 


Since the operators (32.8) are defined in terms of their effect on 
U,, U_, V,, V_, condition IT is automatically satisfied. It is also easy 
to verify using the relation 


Tz0y == —Oysz = taz, ete. (32.11) 


that the matrices (32.9) and hence the operators (32.8) satisfy 
condition I. 

We now establish the transformation properties of the y,, under 
£, and hence verify condition III. Since the y, are not expressible 
in terms of x, y, z, T, one cannot transform them by a simple sub- 
stitution of variables as in § 3. However, one can still define in- 
directly what is meant by transforming such an operator. Suppose 
y~ and ¢ are two functions related by ¢ = yy, and denote trans- 
formed quantities by primes. Transforming this equation in the 
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sense of §§ 3 and 5 means obtaining from it ¢’ = y's’, and if we 
know the transformation properties of ¢ and y we can determine 
those of y. Expressed in a slightly different notation, if 7 is any 


transformation, 
Typ) = y' (TY). 
The operator satisfying this equation is 
y =TyT4, (32.12) 


which gives the transformation of an operator y (cf. equation 
(5.4f)). In our case if y is represented by a matrix I (32.8) and T 
@ proper or improper Lorentz transformation, then y’ is represented 
by the matrix 


(Ditoron (7) [Dito+0 (7) 7-1, 
We confine ourselves from now on to the base vectors u,, u_, 4, 
v_ and use the same symbol T' to denote the Lorentz transformation 
and the matrix representing it with respect to these base vectors. 
The transformation of the y, under J7 and 7 can be derived from 
(31.24) and (31.25). ® ; 
WT 7II- = Ir, IIT; f1-* = I, 
tig 7! = —TI7, tIy77) = I}. (32.13) 
Thus under JT, 7, ITr the y,, transform in the same way as 2, y, 2, 
T, ie. as a regular vector. Turning now to the proper Lorentz 
transformations, the matrices L representing them can be calculated 
from (31.26). Consider the rotation R(@, z) represented by the matrix 
exp(}19M) where M is given by (31.26). From (32.9) M = iI’yI’z so 
that from (32.6) M commutes with I, and Ip. Thus 
RI,R-1 = RR-T;, RI'7R- = Ip. 
Also from (32.6) M anticommutes with I’, and Iy, 
MI, = 11,00, = —il,T,T, = —I;M, My = —IyM. 
Also M? = EH. By expanding the exponentials we obtain 
Tz exp(—}10M) = exp(hiOM) Iz 
and etOM P, e-toM _. etioM tio [ 
= (E cos 6 + iM sin 60)I', 
= I, cos @ — ry sin 0. (32.14) 


Thus the J°, transform according to D‘+) under the spatial Lorentz 
transformation A(?, z). The same applies to the transformations 
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R(@, «) and Liv, z) which are represented by exp(}i6N) and 
exp($@K) (31.26). This follows immediately from the above 
analysis by noting that N = «iI,Ty, K =1I'yT; and by changing 
the appropriate suffices. Now any proper Lorentz transformation 
can be expressed as a product of R(@, z), R(@, x) and L(v, z). It 
therefore follows from all the above that the y, transform under the 
complete Lorentz group according to D+ in the same way as 
2, y, 2, T and the p,. This verifies condition ITT. 


Uniqueness of the y,, 


It remains to prove that the operators y, defined by (32.8), 
(32.9) are the only ones satisfying all of the conditions I, I and 
III. Let 7, be another set of operators satisfying the three conditions. 
From condition II we can write 


Vullg = P, 1,0BUes (32.15) 


where from condition I the matrices I, wane the relations (32.6). 
We now use condition III to prove rs = 47%, 

The first step is to show that the matrices ba are linearly indepen- 
dent, in fact that all the 16 matrices 


E, | ane | 9 ole ae ie PCL 7 (32.16) 
are linearly independent. Using (32.6) we note that each of these 
matrices except EZ can be written as a product of two anticom- 
muting matrices AB = —BA. Eig. 

Pr, = (Poly) ly = —P (Ply), 
| a Oy = (0,0, 0.00 r = Pol, I.' 7). 
The trace (sum of the diagonal elements) of any such matrix is 
zero, since 
A,pBg, = Trace(AB) = Trace(—BA) = —Bg,Aug. 

Let us denote the 16 matrices (32.16) by I",, r = 1 to 16. Then from 
(32.6) any product I, is another I,. In particular I’, = +2. 
Consider now the matrix 

16 

d= > asl" & 

1 

We have that 
Trace (IQ) = ay Trace (+2) 


which is non-zero unless ar = 0. Hence Q 0 unless all a, = 0, 
r =1 +016. I.e. the matrices I; are linearly independent. 
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The conclusion of the last paragraph does not depend on the 
number of vectors u, in (32.15) and applies to square matrices of 
any order n by n satisfying (32.6). The maximum number of linearly 
independent matrices of order n by n is n?, and since there are 16 
matrices I’, they must be at least of order 4 x 4. Thus there is no 
representation of the operators y, (32.6) of dimension less than 
four, and the matrices I, defined by (32.9) in terms of the four 
vectors u, = U,, U_, 04, v_ are irreducible. Equation (14.14) then 
shows that there is, apart from equivalence, only this one irreducible 
representation. Thus I”, and I, defined by (32.9), (32.15) are equiva- 
lent, and from (5.15) there exists a matrix P such that 


P= Pi,P3. (32.17) 
The last step of the proof is to determine P. Let us apply any 
proper or improper Lorentz transformation 7’. In the notation of 


(32.12) let 
I". — TI',T- a Drel', 


where from condition III D,s is determined by the representation 
Dt Similarly 


baa = TI',T-} = TPI,P-'T-}, 


and i", = DrsIy = PDrsl'yP-' = PTT ,T—P-}. 
Comparing these two equations, we obtain 
Ors =I';Q where Q = T-1P-1TP. (32.18) 


Since this is true for any I's, by Schur’s lemma (appendix D) we 
have Q=AH. Now det|Q| =1=A‘t, whence A=-+1 or +i. 
When 7' = £, then A =1 so that, by continuity, A= 1 for all 
proper Lorentz transformations ZL, and from (32.18) 

PL = LP. (32.19) 


The conditions of Schur’s lemma are not completely fulfilled by 
(32.19) since the u, and hence the matrices LZ are not irreducible 
under proper Lorentz transformations. Thus Schur’s lemma only 


gives 
P= | a = 
* Ne 


where P is now expressed in abbreviated form with respect to the 
base vectors U9, U_9 and U%»,, U_. In this representation [7 becomes 


, : | and from []-!P-\IP = AE we easily obtain, by trying 


372 GROUP THEORY IN QUANTUM MECHANICS 
the four possibilities A = +1 or +1, that y, = 7, 0r —7,. In terms of 
U,, U., ¥,, v_ these two possibilities become 


| ee ee ee | 


en eee a fey | 
P=m~{/ 641. Oe ee ae 


1 gk 
whence from (32.9) and (32.17) we obtain I’, = I, or —I, respec- 
tively. The minus sign in the latter case just corresponds to using 
the reversed order of factors in (32.5), and can be shown to lead to 
no new consequences of physical significance. 

Thus apart from the trivial alternative of sign, the y, operators 
are uniquely defined by the conditions I, IT and III. We are there 
fore completely justified in using the specific form (32.9) for the 
y, Without any loss of generality, although some authors prefer to 
derive all results using the defining properties I, II and ITI directly 
without ever writing, down the actual matrices (32.9) explicitly 
(Jauch and Rohrlich 1955). 


Plane wave solutions 


Since (32.1) applies to a free particle without any interactions, 
we expect the Dirac equation (32.7) to have plane wave solutions 


(ayu, + agu_ + av, + ay_)e retin (32.20) 


where the a, are constants giving the amplitudes of the four com- 
ponents #, (32.4). Substituting (32.8), (32.9), (32.20) into (32.7), 
we obtain 


(# — me*)a, + chkza, + ch(kz — tky)a, = 0 
(EB — mce*)a, + ch(kz + iky)ag — chkza, = 0 
—chha, — ch(ky — iky)a, — (EZ + mc*)ag = 0 
—chikz + iky)a, + chk, — (HE + me*)a, = 0. 


For these equations to be consistent the determinant of the co- 
efficients of the a, must vanish, which gives 


(#2 — mct — c*h2h?)2 = OQ 
i.e. EB = E, = (mct + c2h2k®)¥2 ws me? +- a hk? 
2z me* 
or EB = H_ = —(mi*ct + c®h2k?)/2 ss —me* — a hi2k* 


< —mc* 
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in accordance with (32.1). Corresponding to #, we obtain two linearly 
independent solutions for the a,, 


a a= _ ok — thy) 
aah mel Gm | OR met 
—chi(lez + iky) ___ Chik, 
a eed OG ee ee 
(32.214) 


and two solutions corresponding to H_ 
chk, chi(kz + tky) 


WTS tet SP le eet ET eee 
ch(ke — iky) —chk 
=F el °s = mal a, = 0 ag= 


(32.21b) 


There are therefore two types of solution. The one type has a 
positive energy greater than mc*, and the components of the wave 
function involving wu, and u_ are large while those involving v,, 
v_ are small of order p/2mc. The reverse situation applies to the 
other type of solution with negative energy. 


Motion in an electromagnetic field 


The properties of the particles to which the Dirac equation applies 
become much more apparent when we consider their interaction 
with an electromagnetic field. An arbitrary field can be described 
by the four-vector (Az, Ay, Az, Ap = id) where A is the magnetic 
vector potential and ¢ the scalar electric potential (Stratton 1941, 
p. 73). In a field we still have the relation (32.1) and hence (32.7) 
provided Hx and p are the purely kinetic energy and kinetic momen- 
tum mv. However the canonical momenta are given for an electron 
by 

Pp, (canonical) = p, (kinetic) — eA,/c 


where ¢ is the charge on a proton, and it is these canonical momenta 
that have to be replaced by the operators (32.3). Hence in (32.7) 
we put 


22g e 
Py = —h op + 5 Au (32.22) 


The equation still retains all its invariance properties under the 
proper Lorentz group. However, it is no longer exactly equivalent 
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to (32.2) because the p, now do not commute, but we shall assume 
that the correct description of our particles in an electromagnetic 
field is obtained from (32.7) rather than (32.2). This is justified 
firstly by the fact that it correctly describes observed physical 
phenomena, and secondly because the corresponding extra terms in 
(32.2) are just the ones required to keep it Lorentz invariant and 
have no classical analogue. 
If we put 


b= (yy + gu + pgv, + pyv_je ee" 


we can from (32.9b) write (32.7) as two equations for two functions 
yy and ys, each having two components (,, #,) and (if, #4) respectively. 


coy{ it + = Ay) be + (EB + ed — me*)f, = 0 (32.238) 


coy( —it + = As) + (EH + ed + mo*)p, = 0 (32.23b) 
This shows that y; is large and #, is small for positive energy states 
and vice versa for negative energy, in accordance with (32.21). 
We can separate the large and small components more clearly. 
Suppose E is positive and equal to H’ + mc?: multiply (32.23a) 
by (E +e¢ — mc?), (32.23b) by co,(—thO/d; + eAj/c), subtract 
and divide by —2mc?, Then we obtain 


ih 


h? ih e . e2A2 
(- anY ep ——A : grad — — divA +a) + 


2mce 
EH’ + ;j 
( 5 = be = E'th 


e ae 
+ 8 Hh + 778° Obs — 


(32.24) 


Here s = }/c is the spin angular momentum operator used in 
§11 (see particularly problem 11.1), and @, H are the electric and 
magnetic fields. (32.24) shows that ¥, almost satisfies an orthodox 
Schrédinger equation. The terms in brackets describe the inter- 
action between the charged particle and the field in strict analogy 
with classical mechanics (Schiff 1955, pp. 138, 292). In particular, 
for a uniform magnetic field, 
—eih € 
= A: grad = Dane H (32.25) 
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where 1 is the angular momentum (8.27). The term (e/mc)s - H 
gives an additional interaction with an intrinsic magnetic moment 
(—e/me)s. This is proportional to the spin angular momentum 
but twice as large as one would expect by analogy with the orbital 
angular momentum term (32.25). The next term gives the spin- 
orbit coupling as can be seen after some further manipulation. 
From (32.23b) neglecting A and small quantities of order (e¢ 
+ E’)/mc*, we obtain 


2me 
Putting also 
ldd¢d 1 
$=¢\r|, @=—c7 rr sS=_Fhe, 
we have from (32.11) 
(s‘&) (o> ~—ih grad) =i& A (—ih grad) -s= Eis, 
te e ld¢ 
ye 8-6 pb. = — ota as I-s yy, (32.26) 


which is the usual spin-orbit coupling term for an electron in a 
central field (11.21). The last term on the left of (32.24) is another 
small relativistic correction which shifts all the energy levels slightly 
but introduces no new effects. Thus (32.24) has the following three 
features. 


(i) The large components y¥; = (,, #.) occur in the wave function 
multiplied by the spinors u,, u.. Those transform according to 
D/2) under pure rotations as shown in the previous section. This 
point is most important because it means that 

(ii) we can identify the operators 8, = }hoz, etc., with the com- 
ponents of the intrinsic (spin) angular momentum operator. We 
recapitulate briefly the argument of § 11 (see particularly problem 
11.1). In quantum mechanics the angular momentum in the 
absence of a classical analogue is defined through the infinitesimal 
rotation operators (appendix F). The infinitesimal rotations 
operating on u,, w_ are represented by the matrices }oz, $oy, $oz. 
Hence, if we write the wave function in component form (,, #4), 
the spin angular momentum operators are 8; = thoz, ete. 

(iii) On neglecting the last term on the left of (32.24) and the 
approximations inherent in (32.26), #, satisfies an orthodox two 
component Schrédinger equation. The Hamiltonian besides the 


25 
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usual classical terms includes the interaction of an intrinsic magnetic 
moment (—e/mc)s parallel to the spin of the particle, and also a 
spin-orbit coupling term (32.26). 


This is precisely the description of an electron used in § 11 which is 
known to give agreement with experiment (apart from relativistic 
and radiative corrections). We therefore conclude that particles 
described by the Dirac equation really are electrons as expected. 


Positrons and charge conjugation 


In a field theoretical discussion, we find that a theory based on 
the Dirac equation not only describes states with any number of 
electrons but also states containing two types of particles, electrons 
and positrons, which have identical properties except for opposite 
charge. This property is connected with the fact that the field 
equation is invariant under an additional symmetry transforma- 
tion called charge conjugation. When this operates on a state 
vector (wave function) describing the system, it turns it into a 
new state in which electrons are replaced by positrons and vice 
versa. All states have positive energy relative to the vacuum state 
(defined as the state with no particles). 

Now our present one-electron wave equation type of approach 
is really rather inadequate for discussing these features of positrons 
and charge conjugation, but we can exhibit something of their 
properties by considering the negative energy solutions of the 
Dirac equation and going through certain mental gymnastics. 
To start with we note that there are an unlimited number of negative 
energy states (32.21b) with energy going to —oo, so that at first 
sight an electron initially in any state would continuously radiate 
energy as it jumps to states of lower and lower negative energy 
without limit. However, electrons do not behave in this fashion 
experimentally, and we must suppose that all the negative energy 
states are ordinarily filled and that electrons satisfy the exclusion 
principle (obey Fermi statistics) which prevents them from jumping 
into the filled negative energy states.t The infinite charge density 
corresponding to all these filled states is somehow supposed to be 
unobservable, but if a negative energy state happens to be tempor- 
arily empty this will be observed as a net positive charge, namely | 
as a positron. The energy is also positive, corresponding to the 
energy required to lift the electron out of the negative energy state. 

t Incidentally this explains in part why particles with a spin of half an odd 


integer have to obey Fermi Dirac statistics, while particles of integer spin 
obey Bose-Einstein statistics (Pauli 1955). 
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To derive the behaviour of a positron, we have to study the 
dynamical properties of a negative energy state. Equation (32.23) 
shows that #, is now large and 7; small in accordance with our 
experience with the free particle solutions (32.21b). If we put 
E = —mc*? — E” we can separate the large and small components 
as before, and obtain an equation (A) which we shall not write 
down but which is similar to (32.24). The similarity in fact becomes 
striking if we take the complex conjugate of (A) and put ¢; = (w,*, 
—s*), bs = (—da*, x*). Then (A) becomes 


—h? eth eh _. e2A2 
( SNE ep A Srad ho ae Ae )m 


2 Tinet 
: RE’ — 
~-£6.Hg —~0- 6$,— 4, = 28 


(32.27) 


This is identical with (32.24) except that the sign of the charge 
has been reversed! The orbital motion of an electron is described 
by the orbital functions 4, or ¢;, $s, and hence (32.27) shows that 
an electron in a negative energy state moves in an electromagnetic 
fields as if it had a positive charge but otherwise the ordinary mass 
and spin of an electron. Returning to our picture with all negative 
energy states except yf filled, all the electrons in the filled state, 
move as if they had a positive charge, so that the hole corresponding 
to the unfilled state also moves as if it had a positive charge. In 
short, a positron behaves in every way as an electron with a positive 
charge e. 

The relationship between electrons and positrons can also be 
demonstrated using the charge conjugation operator I where + 


Ty, = $a" 
Pug = Cygu. (32.28) 
. —l 
where C= _ 
1 
This transformation turns a negative energy state ¥ into a positive 


energy one as is seen in two ways. Firstly from (32.20), taking the 
complex conjugate changes the sign of the energy, and secondly 


¢ The operator I should not be confused with the matrices I, of (32.9). 
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the operator C turns a wave function with large wu, u_ components 
into one with large v,, v_ ones. Now the probability density 


a (r, t) P(r, t) = ('p,)*(Ty.) (32.29) 
(« summed) 


is the same for both ¢ and I'y, so that they have the same dynamical 
properties which we shall determine by deriving the equation 
satisfied by the positive energy wave function I. The negative 
energy state y satisfies the Dirac equation (32.7) for an electron, 
which in component form becomes 


(Diag? e Me) = 0. 
Taking the complex conjugate we obtaint 
(Pr .ePu* + ime)p,*ug* = (32.30) 


It follows from problem 31.4 that the spinors u,*, u_*, v,*, v_* 
transform under all Lorentz transformations like v_, —v,, —w_, 
u,, i.e. u,* transforms like Cu,. We must regard u,* as still a func- 
tion of the same intrinsic variables as u, and hence as a function 
in the vector space (u,, uw_, v,, v_). Then by Schur’s lemma w,* 
is proportional to Cu, (problem D.2). It can also be verified from 
(32.3), (32.9), (32.28) that It.ep,* = —(CI,C—),gp,. Hence 
from (32.30) we obtain 


(CLC) app, — imc}p,*Cug = 0. 


Here by definition (32.14), the operator represented by the matrix 
CIr,C—' with respect to the vectors Cug is just y,. We therefore have 


ee ae 
|».(—in =) ~ime| rp = 0. (32.31) 
Thus Ty satisfies the same equation as y does. However, if we 


include the interaction with an electromagnetic field, (32.7) and 
(32.31) become respectively 


7) 
[ro(—it 5) — ime + 2y,4,] p= 0, (92.2) 


 @ : 
».(—s 5) — ime — S| Ty =0 (32.33a) 


t Since the u, transform with complex coefficient matrices under real rota 
tions and Lorentz transformations, they must be regarded as complex 
quantities. 
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A comparison of these equations now shows that Is describes the 
motion of a particle which behaves dynamically in exactly the same 
way as an electron with positive energy but having opposite charge, 
and hence from (32.29) so does the negative energy electron state . 
As before, we suppose ys is unoccupied whereas al] the other negative 
energy states are filled with electrons, which is observed as a net 
positive charge. We have now shown therefore that this system 
can be described from the point of view of the total energy, the 
net charge and the dynamical properties by saying that we have a 
positron with positive energy. 

We can now see vaguely how it is possible in field theory to obtain 
the more satisfactory and tidy kind of description of positrons 
already outlined at the beginning of this subsection. We first note 
that by using the pair of equations (32.32) and (32.33) instead of 
just (32.32) alone, we can discard all negative energy solutions as 
physically meaningless, because the negative energy solutions of 
each equation have been redescribed as the positive energy solutions 
of the other. Furthermore if we regard charge conjugation as also 
changing the sign of the electromagnetic field or the charge 


IA = —A or Ie = —e but not both, 


then (32.32) and (32.33) become formally the same equation and 
we have I’ as a symmetry property of the equation. Since I'(I'y) 
= y, the theory is then completely symmetrical between electrons 
and positrons. 


Transformation properties of physical quantities 
All quantities of physical interest such as matrix elements or the 
charge density, involve expressions of the formt 


Prop = 1," Anethe- (32.34) 


Here 6 is some operator and 4 the matrix representing it with 
respect to the u,. In general 5 may also involve multiplication by 
a function of » or differentiation, but this generalization can easily 
be made at the end. 

We shall now discuss the transformation properties of (32.34) 
and start by examining what precisely is meant by transforming it. 
Let P be the point z, y, z, T in our system of co-ordinates and x’, 
y’, 2, T’ for another observer whom we shall refer to as “him’’. 

¢ In writing *, we are here interpreting u* in the sense of (19.9), and not 
in the sense of (19.10) as was done when discussing charge conjugation. 
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Let L denote a proper Lorentz transformation and also the corre- 
sponding matrix. Then from the fundamental definition of trans- 


forming a function, Lu, = L,,u, means 
(our Ug) = Lig (his Uz) 
and (our P,) = Lag? (his yp). 


Hence we have 


(our #,*) yg (our Ys) = (his ¥,*)(Z*-14 1), 6 (his ye). 
(32.35) 


Now [*-14-1 does not appear to be equal to anything special, 
certainly not to the transformed matrix DAL~1 (32.12) because 
Lorentz transformations are in general not unitary. However, we 
can simplify (32.35) if we write 5 in the form 


S=yryr - (32.36) 


where yr is any combination of the y operators, and use the fact 
that yr happens to have the special property 


LD y7 = yok (32.37) 

to be verified presently. Then we have 
L*-14L-) = Pp LI, L-. 
Suppose now that y, transform according to some representation D; 
(his ys) = L(our ys) L-! = Dyes (our yr). 
Using the relation 
(his yr) (his up) = Ir,.¢ (his u_); 

we finally obtain from (32.35) 


(our $*) (our. yryr) (our #) 
= Drs (his #,*) ('7rl'r)ag (his 4g) 
= Dy, (his ¥*) (his yryr) (his ¥), 


p*yrynp transforms under proper Lorentz (32.38) 
transformations in the same way as yr. 


For example the charge density p = —e/*y is not invariant under 
Lorentz transformations as we might naively expect. (Lemma 


that is, 
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C.2, appendix C, is inapplicable because a Lorentz transformation 
is not unitary.) However, if we write p as 


p= —eb*h = —eb*yryrp, 


we see that it transforms like the fourth component of a four- 
vector, as a charge density in relativity should (Stratton 1941, 
p. 70). However the total charge 


—e [ ytpdo = —e | pt yrynp de dy de 


is invariant; because from (32.38) this transforms like y7 dx dy dz, 
which transforms in the same way as dx dy dz d7’, which is invariant 
as can easily be shown in the manner of problem 32.6. Incidentally 
because of the importance of property (32.38), *y7 is abbreviated 
by many authors to ¢. 

It now remains to verify (32.37). In the notation of (31.26), 
consider L = R(8, z) = exp(}i0M), where M = ily, = Mt = M*. 
Hence [* = exp(—}i0M) = L-1, and [7M = MIp. Hence 


ir el =f 'P. TL = f3LT = Fi T. 


Thus (32.37) is satisfied for R(@, z), and similarly for R(@, x). 
Consider now L(v, z) = exp(}i0K) (31.26), where K =iI7IT, = R 
= K*, KI'7 = —IpK, and 16 is real. We have 


L*rph = Ll ph = LE 7 = Ty, 


and (32.37) is again satisfied. For a product L = L,L, of two such 
Lorentz transformations we have 


(£,L,)*T" 7(L,L,) = LAL, *T. rll, =r. 


Since any proper Lorentz transformation can be written in the form 
R(O,, z)R(O,, x)Liv, z), it follows that (32.37) is satisfied for all 
proper Lorentz transformations. 


Summary 


We have sought relativistic wave functions transforming accord- 
ing to D(t°+°) under the complete Lorentz group, and have arrived 
at the Dirac equation. This equation is invariant under the com- 
plete Lorentz group (with the proviso of problem 32.9). The in- 
variance property is sufficient to define the y, operators without 
arbitrariness. The particles described by the Dirac equation have 
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an intrinsic spin angular momentum and a magnetic moment. 
The equation exhibits two types of solution which are related by 
the charge conjugation transformation and which correspond to 
electrons and positrons; more precisely the absence of an electron 
from a negative energy state behaves in every way like a positively 
charged particle which we identify as the positron. The quantities 
like Y*yryrb occurring in physical expectation values transform 
in the same way as the yr. 


References 

The relativistic invariance of the Dirac equation is shown nicely 
by Van der Waerden (1932). A comprehensive field theoretical 
treatment is contained in Jauch and Rohrlich (1955). 


PROBLEMS 


32.1 Derive equations (32.33) in detail. 

32.2 Calculate the matrix ZL representing L(v, z) in the re- 
presentation D(+°+%), Is D a unitary matrix? Why does lemma 2 
of appendix C not apply? In the manner of problem A.9, prove 
LL = E and det|L| = 1. (Dirac 1947, p. 259.) 

32.3 Using (32.22) find the extra terms required to make (32.2a) 
equivalent to (32.7) and Lorentz invariant. 

32.4 A quantum mechanical operator such as the Hamiltonian 
which corresponds to a physically observable quantity, in this case 
the energy, is Hermitian. This means that if it is represented by a 
matrix A, then A* = A in the notation of appendix A (see Schiff 
1955, p. 129). Show from (32.9) that the y, are Hermitian. Show 
also that yryz is Hermitian, and note the relevance of this to 
problem 32.5. 

32.5 Derive the electric current density in the usual way (Schiff 
1955, p. 23) from (32.7) and show that it has components 


Jz = —tec(b*yryzib), etc. 


Note that the fourth component of this vector is correctly related 
to the charge density p by 


p =Ixz/ic = —ef*yryrp = —epryp. 
32.6 If LyeL-1 = D,,y, where L is a proper Lorentz trans- 


formation, and if y, = yzyyyzyr, show that Ly,L[-! = det|D,,|y, 
= y,; i.e. y; remains invariant under proper Lorentz transformations. 
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Hence show that the quantities 1, y,, Yu» = —# Yury — Wu) 
Ys’ x transform respectively as a scalar, vector, skew-symmetric 
second rank tensor, pseudo-vector and pseudo-scalar under proper 
Lorentz transformations and space-inversion. 

32.7. Show, using problem 32.6, that the transformation pro- 
perties of 


php y ry gyi (j = 2%, Y;, z) 


under proper and improper Lorentz transformations are just those 
of an angular momentum vector. This quantity is in fact the spin 
angular momentum density. Verify this for the plane wave states 
(32.20), and prove it generally from the definition of angular momen- 
tum (appendix F). 

32.8 Show that the current density and spin density of a posi- 
tively charged particle in the state Ij, is exactly the same as that 
given by the absence of an electron from the negative energy state y. 

32.9% Prove that the electromagnetic vector potential A, 
transforms as a pseudo-vector of type 1 (Table 36, § 31), whereas 
8/4 transforms as an ordinary vector. Hence, show that the Dirac 
equation with an electromagnetic field is still invariant under space- 
inversion JZ, but no longer under the simple time-inversion (r, 
t—>r, —t) of equation (32.13). However, following Jauch and 
Rohrlich (1955, p. 88), show that there exists a more complicated 
transformation called time-reversal which leaves the Dirac equation 
invariant, and establish its analogy to the non-relativistic trans- 
formation of § 19. 

32.10 Derive the irreducible representations of the 16 operators 
yr corresponding to the matrices (32.16). Hint: show that the 32 
operators yr, —yr form a group: derive its irreducible representations 
D, and strike out those not satisfying D(—yr) = —D(yr). 

32.11* Let yr be the 16 operators corresponding to the matrices 
(32.16). Write down their multiplication table and show that they 
do not form a group because of the occurrence of minus signs. They 
form what is called a ring. Discuss to what extent the theory of 
representations developed in § 6 and appendices C and D applies 
specifically to groups and to what extent also to rings. In this way 
justify our use of (14.14) and Schur’s lemma in proving the unique- 
ness of the y, in the text. References: Van der Waerden 1932, 
p. 54; Jauch and Rohrlich 1955, appendix A; Boerner 1955, Chapters 
I and IT. 

32.12 Discuss the invariance of Maxwell’s equations under 
rotations, space-inversion and Lorentz transformations. Consider 
only the transformation of the components &z, &y, &2, etc., among 
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one another and not their dependence on r, t. (See problem 11.9 
and Stratton 1941, p. 70.) 

32.13 In the field theoretical formalism, the Dirac equation is 
derived as a variational equation from the Lagrangian 


a. me 1e 
B= — Wye get eb — erred. 


Verify that £ is invariant under proper Lorentz transformations, 
space-inversion, and charge conjugation, but not under the simple 
time-inversion 7 (problem 32.9). Note: in the case of charge con- 
jugation, use (0),*/du)%$_ + Y,(O%,*/A2) =O (see Schiff 1955, 
equation (47.8)), and p*yry,,(06/.) + (Op*/On)yry,. = 0 (derivable 
from the Dirac equation). | 


33. Beta Decay 


The electron-nuclear interaction 


Let us consider a nucleus, and suppose that the energy levels and 
eigenstates have been determined in principle from the forces 
between the nucleons as in Chapter VII. Now in practice it is observed 
that nuclei do not stay in such states indefinitely, as we would 
expect if they were true eigenstates for the system. For instance 
it may happen that the 1s atomic electron, which has a finite pro- 
bability of being at the nucleus, gets absorbed (like a photon) while 
the nucleus makes a transition to a new state (Bethe and Morrison 
1956, p. 230). Thus there must be some interaction term in our 
equations which couples the nuclear field with the electron field. 
In the previous section, it was shown how one can only obtain an 
adequate description of states containing several particles or a 
varying number of particles by using a field theory. These remarks 
apply particularly to the process considered above: we start off 
with a state containing Z electrons which then changes continuously 
into a state containing Z — 1 electrons. However, we shall not go 
into the intricacies of such a field theoretical calculation. For 
present purposes it is sufficient to note that the interaction between . 
the electron and nucleon fields can be represented in our formalism 
by an interaction term # jn; in the nuclear Hamiltonian, and that 
this interaction term gives rise in the usual manner of second order 
perturbation theory to a transition probability per unit time of 
(Schiff 1955, p. 199) 


2 2 
= p(EZ) | | ¢tina% infiniti dv] (33.1) 
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Here the #’s are the initial and final wave functions of the whole 
nuclear-electron system and p(H) the density of final states per unit 
energy. 

In this section we shall not consider the electron absorption 
process mentioned above, but only the usual beta decay process in 
which an electron is emitted by the nucleus. We therefore write 
ttinal = Prbe where ¥ is the final wave function for the nucleus 
and pe a plane-wave wave function for the electron going off in some 
direction. It is known experimentally that to conserve spin, energy 
and momentum, the process must involve another particle, the 
neutrino, with zero mass, zero charge and spin 4 which is difficult 
to observe directly (Bethe and Morrison 1956, p. 25). One can 
either consider a neutrino to be emitted in the process or an anti- 
neutrino to be absorbed. We shall adopt the latter description, 


neutron -++ antineutrino — proton + electron, (33.2) 


and therefore write the initial wave function as Yy,, where Y is 
the initial nuclear wave function and yf, is a plane-wave antineutrino 
wave function. 

Now the form of # nt is not really known. For the sake of simpli- 
city it is usually assumed that it does not depend on the total charge 
(like an electromagnetic force), or on the momentum and angular 
momentum of the nucleons (like a spin-orbit coupling). If, further, 
HA int is to be relativistically invariant, it can be shown that Hint 
must have the following form (problem 33.2). 


(33.3) 


il Wet be*# rnp, Vs dv = f Wr" [S (Pn) ton) Pi dv 


Hr) = yr(Cspe*y rp, + C'spe*y ry sy) + 
+ yryu(Cvpe*y ryt, + C’vbe*yryy sty) + 
+ bye (Crpbe*yrynthy + CO the*yryuy st) + 
+ yryuys(Cape*yryuyst, + C’ape*y ryt) + 


+ yrys(Cepe*y ry sh, + C'ppe*yry,). 
(A, » summed) 


Here the y, are the operators (32.8), also ys = yzyyv2YTs Yau = 
— ily, — ¥,,). The operators outside the brackets operate 
on the ‘8 nucleon wave function, and the Cs, etc., are arbitrary 
constants. The operators inside the brackets operate on the electron 
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and neutrino wave functions, i.e. in component form with the 
summation convention 


Le +p, — ad (r) I, wa’y,a(1) ° 


Also in (33.3) 7,” is the isotopic spin operator of chapter VII. 
If the ® nucleon in the initial state is already a proton, it cannot 
beta decay and 7,, gives zero. If it is a neutron, it can decay into a 
proton, and the 7,, turns ¥; into a wave function with the same 
total charge and Mr as ¥;. If we wanted to describe positron 
emission and electron capture we would have to include in (33.3) 
also yr Cs*b,*yrber_ Se 


Relativistic invariance 


The relativistic invariance of the interaction (33.3) follows imme- 
diately from the work of the preceding section. Equation (32.38) 
shows that ¢,*yry. transforms in the same way as yr under 
Lorentz transformations, where yr is any combination of the y, 
operators, e.g. y,,7;. Thus, considering in (33.3) the term in Cy, 
ge*yry,, transforms like a four-vector and ¥;*yry,7,¥% like- 
wise transforms like a four-vector, so that the summation over p 
gives us the invariant scalar product. Similarly, all the other terms 
in (33.3) are invariant under proper Lorentz transformations, 
each being the scalar product of two factors which transform in 
the same way. The suffices S, V, T, A, P stand for scalar, vector, 
tensor, axial vector (pseudo-vector), and pseudo-scalar, respectively, 
because of the transformation properties of the respective factors 
(problem 32.6). However, from (32.13), y; changes sign under 
space-inversion since yz, yy, yz each do but y7 does not, so that the 
two terms in each bracket of (33.3) have opposite parities, and the 
whole interaction is not invariant under space-inversion. The 
consequences of this are discussed below. 


Allowed transitions: selection rules 


As a first approximation the size of a nucleus is small compared 
with the wavelengths of the electron and neutrino, so that we may 
put ¥-(r) = const = ¥-(0), ¥,(r) = ¥,(0) over the nucleus in (33.4). 
This gives the probability for an allowed transition. If it is zero, 
the actual transition probability may still be non-zero though 
small through the variation of #_(r) over the nucleus, such transitions 
being termed forbidden. We shall return to these later and now 
discuss the nuclear part of (33.3) for an allowed transition. Consider 
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the axial vector term. We have for the operators operating on the 
nuclear spin functions 


C. 


. 0 ; 01 
varie =i| an = 2, Y, 2), YTYTYs = IS il (33.5) 


in the abbreviated notation of (32.9b). Now protons and neutrons 
are spin } particles, and from the previous section we would expect 
the one-particle wave functions to satisfy a Dirac equation. While 
this cannot be precisely right because the particles do not have the 
magnetic moment deduced from the equation, we shall assume 
that for our purposes it is sufficient to assume that the Dirac equation 
applies. Then the first two components of the wave function are 
large and the last two smaller by a factor v(nucleon)/c. The opera- 
tors yryjy,5 connect large with large components, and if they give 
& non-zero matrix element we have an ordinary transition. On the 
other hand yryry, connects large with small components, and if 
it is the only term giving a non-zero probability, we have a relativistic 
transition with probability reduced by about (v/c)* ~ 10-8. Similarly 
all transitions can be classified as ordinary or relativistic. 

Let us now return to the operators yryjy, (33.5). For calculating 
the transition probability we may neglect the small components 
of the nuclear wave functions completely and write the operators 
simply as toy = (2t/h)s;. They transform according to D™ under 
rotations, whence in the manner of (13.12) we have the selection rule 
for the nuclear angular momentum quantum number: 


J +J,J +1 except 0 +0, w; = wr. (33.6) 


The selection rule for the parity w follows from the fact that yryjy, 
is invariant under the space inversion IJ (yr is invariant, y; and y, 
change sign). Similarly, the tensor interaction of (33.4) leads to 
ordinary transitions with the same selection rule (33.6), which is 
called the Gamow-Teller rule. However the operator for the scalar 
interaction has the form 

1 0 


which is invariant under rotations, and hence has the Fermi selection 


rule 


The vector interaction has the same selection rule. The selection 
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rules for all the interactions in allowed transitions are summarized 
in Table 38, the rules for relativistic transitions being 


J >J,J +1 except 0 > 0, wi = —w, (33.8) 
or J J, wy == —wr. (33.9) 


Note the difference between the parity rules for ordinary and 
relativistic transitions. 


TABLE 38 
Selection Rules for Allowed Transitions 


Selection rules 

Interaction Ordinary Relativistic 

transition transition 
Scalar equ. (33-7) none 
Vector (33.7) equ. (33°8) 
Tensor (33.6) (33.8) 
Axial vector (33.6) (33-9) 
Pseudoscalar none (33.9) 


Favoured and unfavoured transitions 


So far we have classified transitions as allowed or “forbidden’’, 
and as ordinary or relativistic. We shall now subdivide each cate- 
gory further into favoured and unfavoured. Consider the allowed 
ordinary transitions, and let us suppose the ¥;, Y; are expressed 
in terms of the shell model as linear combinations of determinants 
of one-nucleon wave functions. Now 7+, operating on such a 
determinant gives identically zero if the n*® nucleon belongs to a 
doubly-filled shell, i.e. a shell filled for both protons and neutrons, 
because there is no vacant proton state to go into. Thus doubly- 
filled shells do not contribute to beta decay, and for this reason 
transition probabilities do not increase rapidly with atomic number. 

Using determinant wave functions, we may further reduce 
(33.3) to a sum of simple proton-neutron matrix elements 


f peter yyhy do. (33.10) 


If we neglect the small components of the nuclear wave functions, 
# becomes a linear combination of 1, oz, oy and o, (problem 11.1). 
Thus # changes at most the spin of the particle, and (33.10) is 
zero unless yp and yw have the same n and / quantum numbers. 
For instance a 2p,,. state can decay into a 2p,/, or 2p3/, state but 
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not into 2s,,.. Thus % and ¥; must belong to the same set of 
n, | quantum numbers for a transition to be favoured. Otherwise 
(33.3) is non-zero only because of configurational mixing, which 
reduces the transition probability by a factor of about 0.02 to 0.05 
(Blatt and Weisskopf 1952, p. 721). An example of a favoured 
transition is 

Hes (J = 0) > Li6 (J = 1) 


the configuration in both nuclei being (18,/.)4(2p5/.)?. A useful 
measure of the size of the nuclear matrix element squared is the 
quantity 1/(ft) where ¢ is the half-life and f a correction factor that 
takes into account the energy of the electron-neutrino pair. Favoured 
transitions have logioft ~ 3-1 + 0-3 (Table 39). Now most nuclei 


TABLE 39 
Some Beta Decays 


Initial Final 
nucleus J; nucleus J, logy ft 
neutron $ Hi} 4 3-2 
H? $ He’ $ 3-1 
He*® 0 Lié 1 2-8 
Cio 0 Biox* 0 3-8 
(positron 
emission) 
about 25% of all known decays 5-0 + 0-5 
Be? 0 Bro 3 13-7 


contain rather more neutrons than protons, so that when one neutron 
decays into a proton, the latter for energetic reasons would be 
expected to lie in a lower subshell, ie. Yj and ¥; belong to different 
configurations. We may conclude therefore that, except in light 
nuclei, most transitions will be unfavoured. Indeed the most 
common value of logiofé is around 5, corresponding to allowed 
ordinary but unfavoured transitions. 

In the case of the scalar and vector interactions, # reduces to a 
constant for allowed ordinary transitions, and the conditions for a 
favoured transition become even more stringent. (33.3) becomes 


[ Pe" DreMile, 7, Mer) do = f PT, Pia, T, Mn) dv 
=(T(P +1) — Mo(Mo + 1)P?/%*Pi(o, T, My + 1) do, 


where @ stands for all other quantum numbers required to designate 
the initial state completely. This matrix element is zero, therefore 
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unless ¥; belongs to the same 7'-multiplet as ¥Y; (§ 29). This situa- 
tion is rare for the following reason. Neglecting the Coulomb forces 
and the proton-neutron mass difference, the initial and final states 
have the same energies if the nuclear forces are charge-independent. 
But when we include the effect of the Coulomb repulsion and the 
mass difference, the initial nucleus with lower Z has the lower 
energy, so that we would not ordinarily expect beta decay to be 
energetically possible. An exception is H* > He*® in a T = }, 
J =} T-multiplet (Table 39). Instead the Coulomb forces favour 
positron emission, and an example being the decay C1 > B1o** 
(second excited state) in the 7-multiplet 7 —1, J = 0 shown in 
Fig. 44. Incidentally this J = 0 >J = 0 transition shows that the 
Hamiltonian (33.4) must contain some scalar or vector term from 
(33.7), whereas the 4J = 1 transition He* > Li® (Table 39) show 
that it must also contain a tensor or axial-vector term. 

The present subdivision into favoured and unfavoured transitions 
can also be applied to relativistic and forbidden transitions, though 
in the latter case it becomes much more complicated and loses its 
usefulness. 


Parity non-conservation 


So far in the selection rules (33.6) to (33.9) we have only con- 
sidered parity as far as it affects the nuclear matrix element. We 
shall now consider the effect of the space inversion [7 on the whole 
interaction (33.4). Since the terms with unprimed coefficients 
contain each y operator twice, once operating on the nuclear com- 
ponents and once on the electron-neutrino ones, they all remain 
invariant under J7. The terms with primed coefficients contain an 
extra factor y,;, 80 that they change sign. Thus the transition proh- 
ability would be unaffected by IT if either all primed or all unprimed 
coefficients were zero, because the probability depends on the square 
of the matrix element. However, the transition probability is not 
invariant under JT if (33.4) contains both non-zero primed and 
unprimed terms (Lee and Yang 1956). We shall now illustrate 
this asymmetry by a particular example. 

A nucleus undergoes a decay with spin change J >J — 1. 
By the use of magnetic fields and low temperatures, the nuclei 
have all been oriented with their magnetic moments along the 
z-axis, i.e. they are in the My = J state, so that the transition must 
involve the change My =J > My =J—1. From the selection 
rules (33.6), (33.7), only the tensor and axial vector interactions 
can take part in this transition, and for the sake of simplicity we 
shall assume all coefficients in (33.4) to be zero except Cy, C'a. 
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The axial vector interaction can be rewritten in terms of y = yz 
+ ty, instead of yz, yy, and because of the change in My the only 
terms giving non-zero contributions to the matrix element (33.3) are 


yry_ys(Cape*y ry sys, + OC ape*yrys7¥,)- (33.11) 


We shall only calculate some probabilities for the cases when both 
electron and neutrino are travelling parallel to one another in the 
positive or negative z-direction (upper and lower signs below). 
We only consider allowed transitions, and from (32.21) the com- 
ponents of ¥-(r), ¥,(r) for electron spin parallel and neutrino spin 
antiparallel to the z-axis are 


%e(0): (1,0, FB, 0) (0): (0, 1, 0, 1) (33.12) 


where B = cpe/(He + me’). 

The neutrino function follows from (32.21) by putting m = 0 
and noting that, in our formalism, ¥, denotes an antineutrino wave 
function (negative energy solution; see (33.2)). Substituting (33.12) 
into (33.11) we obtain for the bracketed quantity in (33.11) the 
values 


—104(1 + B)+iC’'4(1 +B), = (ttt), (33.13a) 
iCal + B)+iC’a(1 + B), (454). (33.13b) 


The set of arrows denote the directions of the electron and neutrino 
momenta and spins with respect to the z-axis as follows (Pe, Se; 
p,, 8,). Since the nucleus loses one unit of spin angular momentum 
in the process, (33.2) shows that all other arrangements of electron 
and neutrino spin give zero probability: in particular we have the 
value 


zero for (tt;/t). (33.130) 


In all these transitions the value of the nuclear matrix element is 
the same. Thus from (33.13) we obtain transition probabilities 
proportional to 


(1 + B)*%{[Ca|? + [C’a]? — 2|C4||C’al), (44:4), 

(1 + B)%(|Cal? + [C’al + 2{CallC’al), (tity). 
Le. if neither C4 nor C’, is zero, parity non-conversion in the inter- 
action leads to different transition probabilities for the electron 
and neutrino travelling together along the positive and negative 
z-axes. Note that in (33.13) we are comparing states in which the 
spin directions are the same, because spin is invariant under JJ 
(appendix F). However, even with a parity conserving interaction 
we would not expect equality between (33.13a) and (33.13c), for 

26 
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here we have reversed all momentum and spin directions. This 
corresponds to a 180° rotation about the x-axis, and since we are 
observing the decay of an oriented nucleus, the whole system does 
not have this rotational symmetry. Experimentally one usually 
measures only the rate at which electrons are emitted without 
measuring their spin, and this too is not equal in the positive and 
negative z-directions, as can easily be shown by integrating the 
transition probability over all orientations for the neutrino and 
summing over all spin combinations. We have therefore shown by 
direct calculation that the non-conservation of parity in the inter- 
action (33.4) leads to an asymmetry in the observed angular dis- 
tribution of the electron, an asymmetry that is absent for a parity 
conserving interaction. 


Tests of parity conservation 


In practice, it has only recently been discovered that parity is 
not conserved in the interaction, because previously no experi- 
ment had been done in which parity non-conservation could produce 
an asymmetry (Lee and Yang 1956). For example, in the above 
calculation, we did assume that we had oriented nuclei initially, 
and this is not easy to achieve experimentally. It is therefore 
important to be able to decide easily which experiments will show 
up an asymmetry due to parity non-conservation, and also which 
will give a crucial test of whether other symmetry properties like 
time-reversal or charge-conjugation apply. 

Let us consider the decay of a single nucleus. The quantities 
that can readily be measured are the initial direction of the angular 
momentum J, the momentum pe and spin (polarization) s of the 
electron, and the momentum p, of the neutrino. The last is obtained 
from the recoil momentum py of the nucleus by the conservation 
of momentum p, = —Py — Pe. The parities are 


| J and s — even; Pe and p, — odd | (33.14) 


1t may also be possible to obtain information about the final angular 
momentum of the nucleus from angular correlation measurements 
on a y-ray emitted after the B-decay, but we shall not consider 
that here. Let us denote the quantum mechanical operators corre- 
sponding to these and any other variables by P;, Py; with i, f = 1, 
2,3..., and denote their numerical values in a particular experi- 
ment by 7, p;. Here suffices + and f refer to measurements made 
before and after the decay. Furthermore let the initial state initial 
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be changed into tinal by the interaction Hint. Now by making the 
measurements ~; on the system, we pick out from initia: the 
component 

5( Ps — ps)pinitial 


which is an eigenfunction of P; with eigenvalue p;. Here 5 is the 
Dirac delta function, or rather a product of delta functions, one 
for each measured quantity. This state then decays into 


H in 8( Py — P+) initial, 


and by making the measurements p, we pick out the component 


8(P, — py) AH int8(P — Ps)pinitiar, 
which has the amplitude 


M = i P*inald(P, — pp) intd(Pi — ps) Pinitian dv 
(33.15a) 


in the final wave function. The probability of obtaining the set of 
measurements 74, p, is therefore proportional to |M|?; 


Prob(p,, Ps, Ps, -. -) oc |M|?. (33.15b) 


Let us further apply the inversion transformation J7 under the 
integral sign in (33.15a). This does not change the value of the 
total integral (lemma 2, appendix C). 


M= | IT ptinaS(ITP, — p,)(IT# snt)8(I1Ps ~— ps) ITinitiay Av. 
(33.16) 


Note that here p;, p; are just constant parameters in the integral 
which remain untransformed. For the delta functions we have 
6(P) = 6(—P) and hence 


S(T P, — pr) = 8(P, — Ipr) 


whether JIP, = Py or —Py. Although the p, are just numerical 
parameters, they do refer to physical quantities P, and we have 
written IIp, to denote +, according to whether ITP, = +-P,. 

If we now suppose that #int is a parity conserving interaction, 
TWH int = +H int, then Yanitial and Weinal have definite parities and 
(33.16) becomes 


M = | dinar (P, — py) nt 8(Pt — padynitiar do. 
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Comparison with (33.15) gives for the probabilities 


Prob(p;, Dg, Ps, - - -) = Prob(IIp,, ITp., Ip, . . .). 


(33.17) 


Physically this just means that we get the same results from two 
apparatuses which are inversion images of one another, which is 
just what one would expect from a parity conserving interaction. 
Returning now to the notation of (33.14), we can expand the 
probability (33.17) in terms of J, pe, 8, p, and this gives powers and 
combinations of the products 


Pe’P, J's, JI'peAD,, ete. (33.18a) 


J: Pe, J-p,, Pe ° 8, J:pe A 8, etc. 
(33.18b) 


From (33.14) and (33.17), if #int is parity conserving, then only 
the even parity combinations (33.18a) can occur in the transition 
probability. Thus to demonstrate parity non-conservation, it is 
necessary to measure a combination such as J and pe occurring in 
(33.18b), and to show that the probability distribution contains 
an odd power of cos 6 = J+ pe/|J||pel|, ic. that P(@) A P(w — 8). 
On the other hand measurements of pe and p, alone can never 
show up an asymmetry due to parity non-conservation, since we 
cannot make any odd-parity combination out of these two vectors. 


Forbidden transitions 


If in (33.4) we put ¢e(r) = ¥(0), ¥,(r) = ¥,(0), we obtain zero 
probability for many transitions. These are said to be forbidden, 
but because of the crudeness of our approximation they really 
have a small non-zero probability in general. Taking again plane- 
wave wave functions, we have for the electron-neutrino parts of 
(33.4) 


pe*(r)yp(r) = pe*(0)yy,(0) exp—i(Ke — k,) -r 


«2 t yt 
= (eS Xp ayy Yim(® PMlr/2)' Finn, 4] 
(33.19) 


where 1/A = |Ke — k,|, and 0, ® give the direction of k, — k,. 
The term in the last square bracket enters into the nuclear matrix 
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element. The Yim*( 0, ¢) transform according to D under spatial 
rotations and have parity (—1)!. Thus the selection rules become 


[Ji — Jt] <1 < Jy +I, (Fermi) 


or j’—1] <j < 141 with JJi— Jy] <7 < Ai +d), 
(Gamow-Teller) (33.20) 


and w, = (—1)! uw (ordinary transitions) 
wy = (—1)4! wu, (relativistic transitions). 


From (33.19) introducing the factor (r/A)! into the nuclear matrix 
element reduces it considerably because R/A < 1 where R is the 
radius of the nucleus. Consider for instance the decay 


Bel% = 0, T=1,w = +1) > BY =3, T =0,w = 41) 


among the levels of Fig. 44. From (33.20) the matrix element with 
lowest J is for an ordinary transition of the Gamov—Teller type with 
| = 2 given by the tensor and axial vector interactions. Since the 
initial and final configurations are (1s,/,)*(2p5/2)°, the terms 


| pp*as7 7° Vam(9, dn dv 


in the nuclear matrix element are non-zero without invoking con- 
figurational mixing, so that the transition can be regarded as 
favoured. Thus from (33.19) the matrix element is reduced by 
(R/A)!/(21 + 1)!! compared with an ordinary favoured allowed 
(1 = 0) transition. With R/A ~ 1.0 x 10-2 and | = 2, this gives 


logioft = 3-1 + 2 logy(15 x 104) = 13-5. 


The closeness of the agreement with the experimental value 13-7 
(Table 39) must be regarded as somewhat fortuitous. 


References 


For further details about beta decay the reader is referred to any 
text on nuclear physics, e.g. Bethe and Morrison (1956), Sachs 
(1953), Blatt and Weisskopf (1952). Siegbahn (1955) is an exhaustive 
handbook covering the situation prior to the discovery of parity 
non-conservation. The consequences of parity non-conservation are 
discussed by Lee and Yang (1956) and many papers since then. 


Summary 
Beta decays have been classified into allowed or forbidden, 
ordinary or relativistic, and favoured or unfavoured. Selection 
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rules have been calculated and transition probabilities estimated 
accordingly. The asymmetry in the motion of the emitted electron 
due to parity non-conservation of the interaction has been illustrated. 


PROBLEMS 


33.1 Recapitulate problem 32.6 of the previous section. 

33.2 Verify in detail that the f-decay interaction (33.4) is a 
scalar under proper Lorentz transformations. Also prove that any 
Lorentz invariant scalar interaction not involving derivatives must 
have that general form (33.4) if we always couple protons with 
neutrons, electrons with neutrinos. Hint: any 4 x 4 matrix can 
be written as a linear combination of the 16 products of I, matrices 
(§ 32) which form a reducible representation of the Lorentz group. 

33.3 Prove in detail the selection rules for ordinary relativistic 
transitions (33.8), (33.9) and for forbidden transitions (33.20). 

33.4 Classify the following transitions as allowed or “forbidden”’ 
of order J, ordinary or relativistic, favoured or unfavoured: Be’ 
—> Li’, Si31 + P31, N17 > O17, Mg?? > Al??. Use the shell model 
(§ 29) to determine the initial and final J and parity. Note that in 
an odd A nucleus, the nucleons almost always pair themselves so 
that the total J and parity is that of the last unpaired nucleon. 
It may be assumed that this rule applies to all the above nuclei. 

33.5* A given atom can decay either by the nucleus capturing a 
1s electron or by emitting a positron. Outline how you would 
calculate the ratio of the two probabilities, and discuss whether it 
will depend on the relative magnitude of the C’s in (33.4) (Blatt and 
Weisskopf 1952, p. 684; Bethe and Morrison 1956, p. 230). 

33.6 With the notation of § 31, 


b = Pyro + Pet_o + Pytto, + fatto 


is a spin 4 wave function. Show that there is nothing inconsistent 
in having , = cy,*, Ys = cf,*, c = +1 as regards transformation 
properties under proper Lorentz transformations and under space 
inversion. Express such a ¥ in terms of w,, u_, v,, v_ (31.23), and 
verify from (32.23), with charge put equal to zero, that the Dirac 
equation can have solutions of this type. Wave functions of this 
kind can be used to describe neutrinos in a “two-component” 
theory (cf. Case 1957). 

33.7 Complete the list of (33.18) up to all triple products and 
write down which ones are not invariant under time-reversal. 
Hence discuss possible experiments to test whether or not the 
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beta-decay interaction is invariant under time-reversal symmetry 
(Jackson e¢ al. 1957). 


34. Positronium 


A positron and an electron can form a bound system similar to a 
hydrogen atom, known as positronium. In this section we shall 
discuss the symmetries describing its different energy levels, and 
whether the system in a given state annihilates into two photons 
or into three. A thorough analysis of the interaction between 
electrons, positrons and electromagnetic radiation belongs to the 
realm of field theory (Jauch and Rohrlich 1955, p. 274), but it is 
possible to point out all the (known) symmetries in the non- 
relativistic limit without such an amount of elaborate apparatus. 
Only an occasional reference to the work of §§ 31 and 32 is necessary. 


Symmetry transformations 


Let us consider two charges e, and é, each of which for the present 


may be an electron or a positron. The Hamiltonian for their inter- 
action is 


h? h? e4e. 
=—_— ——_— } rere aires 2 12 
KH = Im V1 om V2 + Th + # spin, (34.1) 
where the spin-dependent interaction is 
Poin = —£1% (fy — 2) A (Vi — ve) (1 — 12) A Vy . 
cacti al 125 275% “1 


_ &1€a] (2 — 11) A (Y2—Vi) (fz — 1) AY, gual: 
me? 11428 2r,.8 7 


C162 }S1 "S82 38, - (F, — £1)82* (tz — 14) 
: mal 12° a ee 


This may be derived in the same way as the spin-orbit coupling 
in § 11 (Heisenberg 1926). 

The Hamiltonian (34.1), and indeed any Hamiltonian describing 
only electrons, positrons and radiation, is invariant under space 
inversion, time-reversal, charge conjugation and permutations. 
Time-reversal will not interest us in this section. The charge con- 
jugation operation I replaces each electron by a positron and vice 
versa, i.e. Te, = —e;, and this is seen to leave # invariant. Now 
the state of our system depends on whether each particle is an 
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electron or a positron, so that we include e, and e, explicitly in the 
wave function and writet 


Fb, Te, 01, Fa, C1) 2) = P(T1, Tar Fy, T2, —€, —Cg). (34.3) 


The Hamiltonian is also invariant under the space inversion II 
(32.16). The effect of IZ on the orbital variables is to replace ry 
by —ri, but we must also consider its effect on the spin functions. 
We saw in § 32 that in relativistic quantum theory the wave function 
of an electron can be written 


ru, + viru + $04 + Ys_v_, (34.4) 


where u,, u_ and v,, v_ transform according to D“/® under pure 
rotations. For an electron with a small energy the component 
wi,, #1. are large of order unity, and s,, %s_ are very small of 
order v/c. For a state describing what is called a positron the com- 
ponents in v,, v_ are the large ones (see equations (32.23)). Hence 
as in Chapter I, we shall in our non-relativistic discussion neglect 
the small components, and write electron and positron wave func- 
tions purely in terms of u,, u_ and v,, v_. In § 11 the convention 
was made that u,, w_ are invariant under J7, and in § 31 it was 
shown that therefore v,, v_ change sign under JI (cf. equation 
(31.24))f 

Ily, = —,, Ilv_ = —v_. (34.5) 


Thus if % describes a state with p positrons (p = 0, 1 or 2 in our 
case), it will contain products of p spin function v,4, and from (34.5) 


TTb(t 1, Ley Fy, Fg) C1» 2) = (—1)?f(—1y, —Te, 94, Fas C1, a). 
(34.6) 


+ This charge conjugation operator is the same as that defined by (32.28). 
It leaves the spin o of each particle invariant (problem 32.8). There is no 
complex conjugation in (34.3) because there are two particles and the complex 
conjugation of (32.28) comes in twice. 

¢ This can also be proved more directly as follows. Since IT commutes with 
the full rotation group, by Schur’s lemma (appendix D) it is represented 
by A,# and A,# with respect to (u,, wu.) and v1, v_), where each A = +1 or 
+4 since 17? = +E. Hence with respect to (u,, u_, v4, v_) IT is represented 
by the diagonal matrices 

(+1 or +4) diag [1, 1, 1, 1) (A) 

or (+1 or +7) diag [1, 1, —1, —1]. (B) 
The other combinations of A,, A, contradict 7? = +H. If L(v) is a Lorentz 
transformation with velocity v, then IZL(v) = L(—v)iZ. Writing Z(+v) 
= exp(+76I) as in (31.5) shows that the matrices representing L(v) and 
L({—v) are not equal so that IT cannot have the form (A). In accordance with 
the convention of chapter II and of (31.24), we choose +1 in (B), which 
proves (34.5). 
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The Hamiltonian (34.1) is further invariant under the permutation 
operation P, 


Pr, = Yo, Po, = Oe, Pe, = €e; Pr, = Yr, Po, = G4, Pe, = ey. 


There are only two particles so that the eigenfunctions of (34.1) 
can be sorted out to be symmetric or antisymmetric with respect 
to P (§ 7). If the particles are both electrons or both positrons, 
the wave function must be antisymmetric according to the usual 
statement of the exclusion principle (§12). In fact % must still be 
antisymmetric if it describes an electron, positron pair. This is not 
obvious in a purely non-relativistic discussion in which the electron 
and positron are being treated almost as unrelated particles. How- 
ever it follows in a relativistic theory because the difference between 
an electron and a positron function is a quantitative one in the 
relative sizes of the four components in (34.4). Thus whatever 
e, and é, are 


PY(Ty, Pes 94, Fa Cy, a) = We, Fy, G2, 04, Cp, ey) 
= —y(ry, Te, 04, Fg, &%, C2) (34.7) 
Now (34.1) can be expressed in terms of the position f(r, + r.) 
and velocity 3(v, + Vv.) of the centre of mass, and the relative 
position (r, — r,) and velocity (v, — v,) of the two particles. Let 
us assume from now on that the centre of mass is at rest. Every- 
thing then depends only on r, — r, as regards space variables and 
Y(T1, Be, Oy, On, Cy, &g) = f(r, — rp) 
= o(—Te, —Ty, 01, Op, €, C2). (34.8) 
Also the first two terms of (34.2) reduce to 


3 ee. (ry — 2) A (v, — ve) 
aa + 8,), 


and # (34.1) becomes invariant under the spin interchanges trans- 
formation &, 


D> vi =T7y, Dey = &, do, = 02, Loy = 01, 
2 (Py, Te, 01, Ga, Cy, Co) = WT, Te, Oo, 04; €1, €g). (34.9) 


Symmetry classification of the eigenstates 


The transformations I, JT, P, X all commute with one another 
and with the full rotation group. Since we have an even number 
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of particles, we shall only require the single-valued irreducible 
representations D) of the rotation group, and it follows from the 
general theory of § 15 that each D“ gives rise to 24 irreducible 
representations of the complete symmetry group of (34.1) by 
associating D') with a set of characters 


such that the matrix y(«)D“(R) represents the transformation «R 
where F is a proper rotation and « = I, IJ, P, X or any combina- 
tion of these. Because of the exclusion principle we have already 
restricted ourselves to the antisymmetrical representations with 
x(P) = —1. The fact that we wish to describe positronium con- 
sisting of one electron and one positron gives the further restrictions 


€; = —Cy, (34.102) 


p=l. (34.10b) 
It follows that 


STIp(r,, Yo, Gy, Oo ey; €5) 
= (—1)?4(—r,, —ro, Og, Oy, —€1, —Eg) 
from (34.3), (34.6), (34.9), 
= U(—Fe, —F4, 01, G2, —€2, —€,) from (34.7), (34.10b), 
= Y(11, Ty, 1, Fg, €1, €g) from (34.8), (34.10a). 
Hence 
x(2)x(L)x(F) = 1. (34.11) 


As in the theory of atomic spectra, # spin in (34.1) is relatively 
small and can be neglected in a first approximation. # is then 
invariant under separate rotations of the orbital variable r, — r, 
and of the spin co-ordinates, giving a configuration with degenerate 
wave functions transforming according to 


DY x D5), where S = 0 or 1. (34.12) 
Any S = 0 wave function contains the spin function 
ULV — UDs 


as a factor so that it is antisymmetric under the spin interchange 
2, and similarly the S = 1 ones are symmetric. 


x(2) = —lforS=0, x(Z)=1forS=1. (34.13) 
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The orbital parts of the wave functions contain the spherical har- 
monics and are multiplied by (—1)! under inversion, so that in 
view of (34.6) and (34.10b) we have 


x(IT) = (—1)44, (34.14) 
Now from (34.11) 
x(L) = (—1)4$ (34.15) 


Thus the symmetry properties of one configuration are completely 
determined by / and S with the help of (34.13), (34.14), (34.15). 
H spin is now considered as a perturbation, and this splits every 
configuration into terms characterized by a J quantum number. 
The configurational set of wave functions transforming according 
to (34.12) decomposes into several term sets transforming according 
to D”) where the values of J are given by the vector coupling 


TABLE 40 
Energy Levels of Positronium 


Level l 8 J x(2) x(P) x(Z7) x(L) 
19 0 0 0 —1 ai 1] a 
1P, 1 0 1 =| 2] 4 i} 
1D, 2 0 2 =A -1 ae | +1 
aS, 0 1 1 1 =| am —1 
yo 1 1 0 1 oy Ey +1 
sp, 1 1 1 1 = +1 +1 
°P, 1 1 2 1 = +1 +1 


eee 


rule (9.2) as usual. In this way we obtain energy levels (terms) 
such as are shown in the usual spectroscopic notation (Chapter IT) 
in Table 40. The table also shows the characters of Py 2 ATP om 
determined by (34.7) and (34.13) to (34.15). 


Transition selection rules 


If an atom makes a transition from one state to another with the 
emission of a photon, this transition is accompanied by a change 
in the charge distribution in the atom. In a semi-classical discussion 
of radiative transitions (Schiff 1955, p. 260), the current associated 
with this shift in the charge is considered to radiate the electro- 
magnetic wave corresponding to the photon. If now the current 
is reversed corresponding to charge conjugation, it follows from 
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Maxwell’s equations that all the fields are also reversed. Thus we 


can write 
I'¢(r) = — ¢(r), 


where ¢(r) is a state function describing the electromagnetic field 
of one photon. Two crossed beams of light are known not to inter- 
act appreciably with one another, so that a multi-photon state is 
just a product of single free photon states, 


D = $,(1) dalta)ha(Ps) « - - n(n) 
and ré =(—1)", —_ x() = (— 1)", (34.16) 


where 7 is the number of photons. 
The Hamiltonian for a free electromagnetic field is quadratic 
in the field strength 


# =< | H+ ee, 
Sir 


and the interaction term between electrons and the field has been 
discussed in § 32. We see therefore that the total Hamiltonian 
# tor for electrons, positrons and radiation is invariant under I. 
If pto¢ is a total wave function, we have 


T(H tovfrot) = Htor(TYtot)- 


Therefore I, considered as a quantum mechanical operator, com- 
mutes with the Hamiltonian and is a constant of the motion. This 
means that its expectation value x(I") remains constant during any 
interaction process (see equation (17.1) or the fundamental theorem 
of § 30). In particular if the electron and positron in positronium 
annihilate into n photons, we must have from (34.16) 


(—1)" = xP) 
where x;(I") refers to the initial positronium state. For instance 
from Table 40, the ground state 18» annihilates into an even number 
of photons, usually two. Similarly the °S, state annihilates into an 


odd number of photons, the minimum number being three since the 
total momentum has also to be conserved. 


PROBLEMS 


34.1 Derive the selection rules for dipole transitions between 
the levels of positronium (Jauch and Rohrlich 1955, p. 279). 

34.2 Write down for a system of two electrons the relations 
which correspond to (34.10), (34.13), (34.14). Show that there is 
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no relation directly analogous to (34.11), but instead y(I7)y(Z) 
= x(P) = —1, and construct a table of levels similar to Table 40. 
Hence show that the exclusion principle in this case excludes some 
of the terms that are allowed in Table 40, and illustrate this by 
writing down the form of some of the wave functions or by con- 
sidering Slater’s scheme (§ 12). 

34.3 Show that the quantities e,, e, in (34.1) may be regarded 
as vectors in a two dimensional pseudo spin space, and hence that 
the energy levels may be described by an additional pseudo spin 
quantum number 7'=0 or 1. Using this formalism discuss the 
difference between the symmetry properties of states of positronium 
and of two electrons as noted in problem 34.2. Also write down the 
annihilation selection rules in this general formalism. 

34.4* Write down in the notation of field theory the relations 
that correspond to equations (34.3), (34.6), (34.7), (34.8), (34.9), 
(34.10), and hence derive (34.11), (34.13), (34.14) (Jauch and Rohr- 
lich 1955, p. 274). Note in particular how the commutation relations 
in field theory replace the exclusion principle, i.e. antisymmetry of 
y in the wave function formalism. 


Appendix A 


Matrix Algebra 
The following gives the amount of matrix algebra assumed in the 
text. On some occasions when a more advanced result or concept 


is required, references are given to Margenau and Murphy (1943, 
Chapter 10). 


Definition of a matrix 


Consider the linear transformation of co-ordinates from (z, y, 2) 
to (X, Y, Z) 
a= AyX + ApY + A,5Z, 
yY = Ayn X + AnY + Ags, 


This system of equations is one linear transformation and it is 
also often convenient to treat the coefficients A,,, Ay, . . . Aas 


as a whole. The square array of coefficients 


AAAs Ay AjAis 
Ag AeA os or AoA yA og 
Ay AAg, Ay, As,A og 


is the square matrix A, the brackets indicating that the matrix 
refers to the array as a whole and incidentally not to the deter- 
minant of the coefficients. More generally, it is convenient to 
consider rectangular matrices with m rows and n columns, these 
then being of order m x n (pronounced m by n). Ay is the (ij) 
element of A, where ¢ is the number of the row and j the number of 
the column in which Ag is situated. Sometimes A is referred to as 
“the matrix A;;”, meaning the matrix whose (i) element is Ajj. 
In general it is not necessary for matrices to arise only in connection 
with linear transformations in the way they have been introduced 
here, but they frequently do either directly or indirectly. 


Multiplication 
Consider another linear transformation 
X = Byé + Byoyn + Bysf, 
Y = Byué + Boon + Bgl, 
Z = Buk + Boon + Bas6. (A.2) 
404 
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Suppose we have a function of z, y, z and we wish to express it first 
in terms of X, Y, Z and then in terms of é, 7, ¢. We would first 
substitute for x, y, z using (A.1) and then use (A.2). If, however, 
we are only interested in the final result, we can shorten the cal- 
culation by eliminating X, Y, Z between (A.1) and (A.2) and 
using directly the resulting transformation 


© = (Ay By + Ay Bo + ByAgi)é + (Ay, Byy + Ay,Boo + 
sf A13B30)n “fe (41, Bis ae A1.Bos oF Aj, B53)f, 
¥ = (Ay By + AgBy + Ag, Ba)éE + (Ag By + Ag Boy + 
=f Ay3.Bye)n =f (Ag, Bis a Ao. Bog i As,Bgs)C, 
z= (4,By+ Ax.Bo, + AsgBa)é + (Ag, By, + A3.Boo + 
+ As,By9)n + (45, Big + Ase Bog + Ags Boa)f, 
(A.3) 
which is a combination of (A.1) and (A.2). Denoting the coefficients 
in (A.3) by 


3 
Cy = 2 AipBx;, (A.4) 
=1 


we would like to express the relationship of the matrix Cy to the 
matrices A and B. This is accomplished by defining C to be the 
matrix product 

C=AB (A.5) 


of A and B in this order, where {A.4) is the rule for finding the 
elements of C from those of A and B. More generally, if A is a 
matrix of order m x p and B is of order p Xn, the product C 
= AB is defined as the matrix Cy of order m x n where 


Dp 
Cy = > Ag By. 
k=1 


The order of multiplication is important, and the product D = BA 
is a different matrix from C. 


Dy = > BurAry. (A.7) 


(A.6) 


In fact unless m = n it is impossible to carry out the summation 
in (A.7), and even then we see from (A.6) and (A.7) that Dy 4 Cy. 
Also (A.6) and (A.7) are similar to the laws for multiplying deter- 
minants, so that if Aand Bare Square matrices of the same order, then 

|AB| = |BA| = |A]|BI, (A.8) 
where |4 B| is the determinant of the matrix AB. 
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With this definition of matrix multiplication, the transformation 
(A.1) can also be written completely in matrix form. Let g be the 


matrix 
11 
q=|y)- (A.9) 
r4 


Such a matrix with one column is called a column matrix (or 
column vector) and is often written {xyz} to save space. Similarly 
al X n matrix is a row matrix. In this notation (A.1) becomes 


gq = AQ (A.10) 
where Q = {XYZ}. Similarly (A.2) can be written 

Q = By (A.11) 
where v = {én}. Then (A.3) becomes 

gq = ABy, (A.12) 


and this result also follows directly by substituting (A.11) in (A.10). 


The double suffix summation convention, also known as the 
dummy suffix notation. 


Although it is often convenient to write matrix products in the 
compact form (A.5), it is on other occasions useful to work in terms 
of the elements as in (A.6). Then summations over some of the 
suffices appear constantly, and it is customary to omit the sum- 
mation signs but to consider any product to be automatically 
summed over every suffix such as & in (A.6) that occurs twice in 
one product. With this convention (A.4) and (A.6) become 


Cy = AnBry- (A.13) 


Before (A.1) or (A.10) can be written in this form, it is necessary 
to alter the notation slightly and to put 


h=% B=y% GW=% a=, ete (Als) 
Then (A.1) and (A.10), (A.2) and (A.11) become 
qi = AyQ;, Q = Brave 
whence by substituting for Q; we have immediately 
a: = AgByxvy = Cyr, from (A.13), 
in agreement with (A.3) and (A.12). Incidentally (A.13) can equally 
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well be written Cy = AyBy or Cyy = Agi By or in any other equiva- 
lent form which retains the relationship between the suffices. In 
particular a repeated suffix can be called anything at all, since it 
is being summed over and thus would not appear in the relation 
if it were written out in full. Such changes are frequently necessary 
in making substitutions, to ensure that the same letter is not being 
used for indices that are not necessarily equal and that should not 
be summed. The particular symbol 


64 = 1 fort =j, 
A. 
| nee ev) 


is often used. For instance 8;A;, = Ag. It has the effect of re- 
placing the suffix 7 by ¢ or vice versa. 


Derived matrices and special matrices 


Starting with a matrix A of order m x n we can construct another 
matrix A of order n x m called the transpose of A, such that the 
(ij)*® element of A is given by 


If A is a square matrix of order n x n and the determinant 
| A| 40, it is possible to construct its reciprocal or inverse A-) 
also of order n X n with the property 


| AA1=A7IA=EH# (A.17) 


where FE is the unit matriz (which is always square) of order » with 


elements 


i.e. H has the form 


] 0 0 0 
0 | 0 0 
0 0 1 0 
0 0 0 1 


27 
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It is possible to express the elements of A-! in terms of the Ay 
but for the present purposes it is immaterial to know what they 
are. From (A.17) and (A.8) 

1 


[47 | = 4)’ (A.19) 


which shows why it is not possible to construct A-! if |A| = 0. 
A unitary matrix U is a square matrix with the property 


U-1 = 0* (A.20) 
where * denotes taking the complex conjugate of every element. 


Thus from (A.17) 
U*U = U0* =E (A.21) 


|U||o|* =1. (A.22) 


The zero matriz 0 is a matrix of any order with all elements zero. 


and from (A.8) 


PROBLEMS 


A.1 Write down some matrices with simple numbers as elements 
and find their products. With square matrices check your result 
using (A.8), and also verify that in general AB 4 BA. 

A.2 Write down the (#j)*® element of each of the matrices 
D, = A(BC) and D, = (AB)C using the double suffix summation 
convention, and hence show that D, and D, are the same. Also 
write equations (A.17), (A.20) and (A.21) in terms of the elements 
in this way. cas 

A.3 Using the summation convention show that (AB) = BA 
and that (4) = A. 

A.4 Show that if A is a matrix of order m x n and Em, Ex 
are respectively unit matrices of order m x m and n X n, then 


AE, = EnA =A. 


A.5 By matrix multiplication show that (4B)-! = B-14- 
and that (A-1)-! = A. 

A.6 Show using (A.21) that the transformation matrix R of 
the transformation (2.2) is unitary. What are |R| and R-12 With 
the notation of (A.10) if g = RQ, show by multiplying by R-! 
that Q = R-q and verify this by solving the three simultaneous 
equations (2.2) for X, Y and Z in terms of 2, y, z. 

A.7 If B= U*AU where U is unitary, show that |B| = |A| 
and By = Ag (summed)}). 
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A.8 Show that the product of any number of unitary matrices 
is also unitary. 

AQ If g = AQ in the notation of equation (A.10), show using 
the result of problem A.3 that 


Q*A*AQ = G*q (A.23) 


and that each side of this equation is a one by one matrix, ie. a 
single number. Show further that if A is unitary, then (A.23) when 
multiplied out becomes 


x? + y? +22= X24 y2 + Z%. (A.24) 


Conversely, show that if equation (A.24) is to hold, then 4*A must 
be the unit matrix Z, ie. A is unitary. This is in a simplified form 
the most important property of unitary matrices (cf. appendix 
C). Since the Hamiltonian of almost every conceivable physical 
system can be written so as to involve r or ry (cf. § 3), it follows 
at once that the symmetry transformations of such a Hamiltonian 
are unitary transformations. 

A.10 The functions ¢;, 1 =1 to n, are normalized and ortho- 
gonal. The n functions $; = Ay d; (summed) are also normalized 
and orthogonal to one another. Prove that the matrix A is unitary. 


Appendix B 


Homomorphism and Isomorphism 


Definition. Two groups G, and &, are homomorphict if some 
element of ®, can be associated with every element of &,, such that uf 
P,Q, = R, then P,Q. = R,, where Ps, Q2, R, are the elements of 
©, that correspond to the elements P,, Q,, R, of Gy. 

Definition. Two groups G, and ©, are isomorphic if there 
exists a one-to-one correspondence between the elements A,, B,, Cy, ... 
of ©, and A,, By, Cy, .. . of Gy, such that if P,Q, = R, then PQ, 
= R, and vice versa. A relationship between the elements of ©, 
and @, is said to be a one-to-one correspondence if the elements of 
G, and G, can be paired off together, A, and A,, B, and B,, etc., 
in such a way that each element of each group is paired with one 
and only one element of the other group. If the groups are finite 
this implies that they have the same number of elements. From 
the definitions, an isomorphism is automatically a homomorphism, 
and is in fact a special symmetrical kind of homomorphism which 
can be described approximately by saying that two isomorphic 
groups have exactly the same structure as regards the relationships 
of the elements in each group to one another. 

The rotations (4.1), the linear transformations (4.13), the matrices 
of the coefficients in these transformations (the law of combina- 
tion in this case being matrix multiplication in the order FS = C, 
cf. equation (4.19)), and the permutations (4.22) form groups which 
are all isomorphic with one another. This follows from the discussion 
of § 4 which establishes a one-to-one correspondence between 
elements of any two of these groups. As a first example of a homo- 
morphism, consider the point-group 32 of rotations #, A, B, K, 
L, M (equation (4.1)) as G,, and the group g = (e, a, b, k, l, m) 
as ©, where e=a=b=1 and k=l=m=-—1. The fact 
that the six elements of g are not all different from one another 
does not violate any of the group properties of § 4.{ Now if we 

+ Some authors denote by simple isomorphism and by isomorphism (or 
multiple isomorphism) what we call isomorphism and homomorphism res- 
pectively. 

t Some authors adopt a more restrictive definition of a group than that 
given in § 4, by including the further condition that all the elements be distinct 
(Ledermann 1953, pp. 2, 3). Then the matrices of a representation do not 
always form a group, which would be inconvenient for the purposes of this 
book. 
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have a relation such as KA = L (4.5), then ka =1 also holds as 
can easily be verified from the multiplication table of G, (Table 1), 
so that the groups are homomorphic. However, since | = m, we 
also have ka =m although KA ~M. Thus the “vice versa” 
condition of the definition of isomorphism is not satisfied. Our 
second example of homomorphism is closely related to the first. 
Consider the group of rotations 32 again as ©, and the group 
g’ = (1, —1) as G,. If we associate the element 1 of 9’ = G, with 
each of the elements HZ, A, B of G, and —1 with the elements 
K, L, M, then as before S,F, =C, always implies S,F, = C, 
so that the groups are homomorphic. However, the “vice versa” 
is again not satisfied and in addition the number of group elements 
is unequal so that the groups are not isomorphic. As a third example 
consider the roles of the groups 32 and g’ reversed, i.e. g’ as G, 
and 32 as ©,. Then we can associate H with +1 and K with —1, 
in which case S,F, = C, always implies 8,F, = C, and vice versa, 
so that the groups are again at least homomorphic. However, they 
are still not isomorphic because although the ‘“‘vice versa” condition 
is satisfied, not all the elements of G, have been paired off with 
elements of G,. 


PROBLEMS 


B.1 Construct some other examples of isomorphism and 
homomorphism. 

B.2 The matrices A, B, .. . form a group @. Show that the 
determinants |A|, |B, . . . form a group which is homomorphic 
but not necessarily isomorphic with G. 

B.3 Try to express in your own words the additional condition 
that a homomorphism must satisfy for it to be an isomorphism, and 
test your wording on the examples given above. 

B.4 Compare the definitions of homomorphism and isomorphism 
in some of the texts on group theory given in the bibliography, 
testing them on the examples given above. 


Appendix C 


Theorems on Vector Spaces and Group Representations 


In § 5 the results of certain theorems are appealed to, and these 
will now be proved. It should be noted that the following (rather 
clumsy) proofs have been produced only to show that the argument 
of the main text can be justified step by step. They are therefore 
as elementary as possible, involving only the minimum number of 
intermediate concepts and results. For much more comprehensive 
and elegant developments of the theory of representations, the 
reader is referred to the standard texts mentioned in the biblio- 
graphy, e.g. Van der Waerden 1932, Wigner 1931, Speiser 1937, 
Wey! 1931. 

Theorem 1. The dimension of a vector space is unique. Let 4, 
j =1 to n, be n linearly independent base vectors spanning the 
space R(¢d, . . . dn). Further let ¢';, 1 = 1 to m, be another set of 
m linearly independent vectors spanning exactly the same space X. 
It is required to prove that m = n. 

Suppose m < n. Since the 4 span the space R, we can express 
the ¢; in terms of the ¢’; and we have the n relations 


$i = Pyd's (C.1) 


where the Py; are some coefficients. Let P be the square matrix 
of order m x m formed from the first m rows of the coefficients 
Py, 4 = 1 to m, j = 1 to m. Since the ¢; are linearly independent, 
it is impossible to find m coefficients «; such that 


ay Prey = 0, (C.2) 


as otherwise we would have asd; = ajPij¢'5 = 0. Since (C.2) has 
only the solution «; = 0, all j, we have det|P| #0, and hence 
can solve the first m equations of (C.1) and obtain 


$'5 = (Pade. 


This result can be substituted into the (m + 1) equation of the 
set (C.1), which then gives a linear relation among the ¢; contrary 
to the hypothesis that they are linearly independent. Therefore 
m <n, and similarly n « m. Therefore m =n, which proves the 
theorem. 

Lemma 1. Jf & is a vector space and t a subspace of RB, then tt 
ts possible to construct a unique vector space s such that s is orthogonal 
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to t and R=r+s. Let the orthogonal, linearly independent 
functions ¢,, . . . ¢¢ span the space r. By taking any functions of 
R, it is possible to define further base vectors in R, and by the 
process of § 5 to make them orthogonal to the ¢,,... ¢, and to 
one another. Let this give the further base vectors brit, -- » On, 
where v is the dimension of 8. Now the vector spaces = (¢r44,... 
¢n) is orthogonal to r (ie. any function of s is orthogonal to 
any function of r). Further any function ¢ of R can be written 


n r 
¢ = > caps, i.e. in the form 6 = g + g® where g) = > c¢¢ is some 
1 I 


n 
function belonging to r and 4‘) = > cp¢ some function belonging 
1 


r+ 
tos. Hence 8 =r +s. Moreover, the splitting up of ¢ into ¢™ 
and ¢) is unique if $' has to be orthogonal to 4, so that s is 
unique. This proves the lemma. 

Lemma 2. All symmetry transformations (except Lorentz trans- 
formations) leave the scalar product of two Junctions invariant. We 
shall first prove the lemma for a rotation R. Let f be a function 
of x, y, 2, and let us for the present consider R as a change to a new 
co-ordinate system X, ¥, Z in the sense of §§ 2 and 3. Then f can 
be expressed in terms of X, Y, Z, say f(x, y, z) = F(X, Y, Z). 
The volume element dv can be written as either dx dy dz or dX d¥ dZ 
(Margenau and Murphy 1943, p. 190), and thus if the integrations 
are carried out over all space, 


| f (@, y, 2) dx dy dz = [ F(X, ¥,Z)dXaYaz. (C3) 


Physically the reason for this equality is that we have not changed 
the value of the integrand at any point P, but simply referred to 
the same point P by two different labels (x, y, z) and (X, Y, Z). 
If we put f = ¢*% where ¢ and y are two functions, and interpret 
(C.3) in terms of the notation of equation (5.3), we obtain 


J $b de dy de = f (RAy*(RY) de dy de. 


Thus the scalar product of ¢ and remains invariant under the 
transformation R. 

The same proof also applies to any transformation for which the 
volume elements are equal, dg, dg. . . . ddsn = 4Q: dQ: .. . dQsn 
in the notation of equation (4.15), and this is clearly true for the 
other types of symmetry transformation that arise namely permuta- 
tions, the inversion and reflections, though not for a Lorentz trans- 
formation (problem C.1). This establishes the lemma. Incidentally 
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the reader should have no difficulty now in writing a general proof 
of the lemma using the above type of argument and the hint con- 
tained in problem C.3. Transformations which leave the scalar 
product of two functions invariant are called untiary transformations 
(cf. problem C.2), and it will be implicitly assumed in this appendix 
that all transformations mentioned are unitary. 

Corollary to lemma 2. If the functions ¢ and y are orthogonal, 
then so are T'¢ and Ty where T is a unitary transformation. 

Lemma 3. If the vector space R =t +5, and & and t are each 
invariant under a group & of (unitary) transformations, then $ is 
also invariant under ©. Let ™ and ¢) be any two functions 
belonging to r and s, and let 4, = 7-14 where TJ is a trans- 
formation of &. Since 7'-! also belongs to G, ¢,™ belongs to rt. 
Therefore ¢ is orthogonal to ¢,", and by the corollary to lemma 
2, T¢* is orthogonal to 74, = ¢. Since XR is invariant under 
&, 7 belongs to R. We therefore have that 7'$%) belongs to R 
but is orthogonal to any function ¢ of r. Hence 74) belongs to 
s, and s is invariant under ©, which proves the lemma. 

Theorem 2. If ¥& is a vector space invariant under a group ® 
of transformations, then & 1s either irreducible and contains no 
invariant subspace, or & 1s reducible into a sum of orthogonal invariant 
subspaces. We shall first point out just what there is to be proved. 
In § 5 we defined and investigated the consequences of having an 
irreducible or reducible vector space, and showed that a reducible 
space can be split up into a sum of invariant subspaces. Proceeding 
in this way we were not able to prove that the representation can 
always be reduced in such a way that the invariant subspaces are 
orthogonal to one another. For even if R had been referred to 
orthogonal base vectors ¢; initially, this is no guarantee that after 
the equivalence transformation (5.15) the base vectors ¢'; of the 
reduced space would be orthogonal. The present theorem shows 
that the invariant subspaces can be made orthogonal to one another. 
We were also unable to prove in § 5 that the following situation 
never in fact arises. Consider & transforming according to the 
representation D. Suppose that D is irreducible, but by choosing 
new base vectors ¢% in R and applying the equivalence trans- 
formation (5.15) it is possible to bring the matrices of D halfway 
towards reduced form, i.e. into the form 


ie T) Q(T) 
0 Dy (T)y 


where @jj(7') is not zero and Dy")(T) of order n, x n,. Then the 
first n, base vectors ¢’, ...¢'n, transform according to the 
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representation Dy')(T') and form an invariant subspace. However, 
the remaining ¢’; do not transform according to Dy'®(T) because 
Qy(7) is not zero. Thus we would have an irreducible vector space 
containing an invariant subspace. This situation does not corre- 
spond to either of the two possibilities allowed by the theorem and 
cannot in fact arise. 

From § 5, a reducible vector space always contains an invariant 
subspace. Hence a space that does not contain an invariant sub- 
space is irreducible. Now consider on the other hand that & does 
contain an invariant subspace r. Then by lemma 3 it also contains 
another invariant subspace s orthogonal to r and such that R 
=t+s. Let¢’,,...¢’,and ¢’,,;,...¢', be two sets of base vectors 
spanning t ands. Then the transformation in R to the base vectors 
?';,.-- $'n is an equivalence transformation of type (5.15) which 
brings all the matrices of the representation D into reduced form. 
The process can then be continued until none of the subspaces can 
be reduced further. We have therefore shown that if a vector 
space & does not contain an invariant subspace, then it is irreducible; 
also if it does contain an invariant subspace, then it is reducible 
into a series of orthogonal, invariant subspaces. Since R must 
either contain or not contain an invariant subspace, this exhausts 
all the possibilities, which proves the theorem. 

Schur’s lemma fits into the logical sequence of steps at this 
stage. In view of the importance of this lemma, its proof has been 
placed in a separate appendix (appendix D). 

Lemma 4. If a reducible representation D of a group © is reduced 
tnto irreducible components D one way 


D= DY + DVW+ ... 4+ DY, 
and also in another way 
D= DO 4+ DA +... 4+ De, 


then D‘) ta equivalent to one of the representations D, .. . Di), 
Let 8 be a vector space transforming according to D. Let the base 
vectors dj"), bj(?, 2. dy", i = 1 tom, j =1tom,...,k=1 
to my in R transform according to D™, D®, ..., D™ and similarly 
let the vectors ¢,"), 9), .. . dy?) in R transform according to 
D®, D'®),... DD”), Each of these two sets of vectors is linearly 
independent in accordance with an equivalence transformation of the 
form (5.15) with det| P| + 0, so that none of the vectors are zero. The 
¢;‘*' can be expressed in terms of the base vectors ¢;'), . . . dy”; thus 


dy) = Py + Py dhe Ho .. Py hi, 
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If T is any transformation of G, 7'¢;‘*) can now be written in two 
ways, 


To) = Py Dy TL) dg +... 
and T¢; = Dy@(T) di 
= Dy (LT) Pde +... 


Hence each of the matrices P'*) satisfies the conditions for Schur’s 
lemma (appendix D) and is therefore zero unless D) is equivalent 
to D‘), Since the ¢;~) are not zero, D is equivalent to at least 
one of the representations D), ... D'"), which proves the lemma. 

Lemma 5. Functions transforming according to different irreducible 
representations of a group are orthogonal. Let ¢;") (¢ = 1 to n) be 
a set of normalized orthogonal functions and ¢,;'*) another such set, 
transforming respectively according to the non-equivalent irreducible 
representations D® and D, Take any functions and orthogonalize 
them to the ¢;" and to one another by the process described in 
§ 5 to give a set $44, On42) -.- Which together with the ¢;'" form a 
complete set. Then 


n cO 
dy = > Pudi? + > Parr. 
1 n+l 


As in lemma 4, Py =0 since D®@ and D® are not equivalent. 
But Py is just f ¢;'*¢, dv integrated over all space, so that ¢,')) 
and ¢;‘) are orthogonal for all + and j. 

Theorem 3. The reduction of a reducible representation is unique, 
apart from equivalence. In the notation of lemma 4, it is required to 
prove that the representations D™, D@®, ... D' and D@, D®, . , 


D) are pairwise equivalent. From lemma 4, each D®, ... Di) 
is equivalent to one of the D™, .. . D‘”, and this proves the unique- 
ness if none of the D™,... D‘r) is equivalent to another. 


Consider now the following case in which this is not so. Let D@ 
be equivalent to D™ and D‘?) but not to any of the D®, ... Di’, 
and let us make a trivial change of base vectors such that these 
representations become identical. Then in the notation of lemma 
4, from Schur’s lemma (appendix D) P!) =ak, P’) = bE, Pt) 
= PY =... = Pl) = 0, ie. Jy = ade + bd,'?). Now construct 
fy! = bd," — ad,'). Then the vector space 


(EO. DEG. ODE. Hb. .) (C4) 


is orthogonal to (. . .¢;'*). . .) by lemma 5 and by construction. 
Similarly from the ¢;, .. . ¢4) we can construct functions 
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vs‘), . . . yx") transforming according to the same representations 
D‘), . . . D) but orthogonal to the ¢;'*). Hence the vector space 


i ce ee ee Se 2 a 9 


is also orthogonal to (... ¢;).. .), and hence by the uniqueness part 
of lemma I equal to the space (C.4). Hence 


DD) +4 Ye 4... + Di = Di 4. pm +...+ De), 


The process can now be repeated until all the component representa- 
tions have been paired off. The proof can also be extended easily 
to the case where any number of the irreducible components are 
equivalent, which proves the theorem. 


PROBLEMS 


C.l1 Show that the inversion transformation (3.11) and the 
permutation of two co-ordinates (3.10) are unitary transformations. 
If two transformations S and 7 are each unitary, show that the 
combined transformations 7'S and ST are also unitary. Hence 
show that all symmetry transformations mentioned in §§ 3 and 4 
are unitary. 

C.2 A unitary transformation 7 operating on a set of normalized 
orthogonal functions ¢; induces the transformations T¢; = Dy(T) d+. 
Show that D(T’) is a unitary matrix (appendix A). 

C.3 Give a general proof of lemma 2 using problem A.9 (appen- 
dix A) and the form of the volume element in general co-ordinates 
(Margenau and Murphy 1943, pp. 187, 190). 

C.4 With the notation of problem C.2, show that > di*dy is 
invariant under 7". t 


Appendix D 


Schur’s Lemma 


Given: Fyj()(T) and Iy‘*(T) are the matrices of two irreduc- 
ible representations I) and I’) (of dimensions n, and ny) 
of some group, and there exists a matrix P (of order 
MN, X n,) such that 

PPY(T) = PeT)P (D.1) 
for all group elements 7’. 


The lemma states that 
(i) if 7 and I are not equivalent, then P = 0; 
(ii) if 7 and I’) are equivalent, then P = 0 or det|P| 4 0; 


(iii) if Ty(7) = Fy(T), ie. all the matrices of P™ 
and I?) are equal and not just equivalent, then P = 0 or 
P =2E where J is a constant and £ is the unit matrix. 


Proof. Step 1. Choose any set of n, linearly independent vectors 

u, transforming according to I‘), i.e. 
Tuy = Fy(T ug, (D.2) 

and define a set of n, vectors 

y= Py. (D.3) 
Let us also assume that n, S$ n,. This involves no loss of generality, 
since if n, > nm, we can replace P, I(T), PT by the transposed 
matrices P, f'2(7), (7) respectively throughout the argument. 


Step 2. Now Tv; = TPiyuy from (D.3), 
= Pye'(T) Py from (D.2), 
= Pauly (T)ux from (D.1), 
= Py (1), from (D.3); (D.4) 


i.e. the vectors v; transform according to the irreducible representa- 
tion FP). . 
Step 3. Suppose that P #0. From (D.3) the vector space 
(...%-.-), t = 1 to m, has a dimension n’, S$ n, which is less than 
or equal to the dimension n, of the space (...u...), 1 = 1 to mg. 
Hence from step 2, we have inside the space (.. . u;. . .) an invariant 
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subspace (. . . %.. .) transforming according to [™., If mn’, were 
less than n,, this would make the space (...u,...) and hence the 
representation J“) reducible contrary to hypothesis. Hence n’, = ne. 

Step 4. Hence the space (. .. w;...) and the space (...%...) are 
the same (appendix C, Theorem 1). Since also n', Sm S ns, 
from step 3, n, = ”,, and the representations [’) and I?) are 
equivalent. This is still on the assumption that P 40. Thus if 
F™ and I are not equivalent, we must have P = 0, which proves 
part (i) of the lemma. 

Step 5. Now suppose that I and I?) are equivalent. Thus 
N, = M, and by step 3 if P + 0 the vectors 2; are linearly indepen- 
dent. Le. it is impossible to find a set of numbers ay (not all zero) 
such that «jv; = 0. 

Step 6. It is therefore also impossible to find numbers a (not ali 
zero) such that 

og Poy =0 for all i, (D.5) 


since (D.5) would imply ajv; = a;Pyum = 0 contrary to the result 
of step 5. Since the equations (D.5) have no solution except a; = 0 
for all j, we have |P| 40 (Margenau and Murphy 1943, p. 299), 
which proves part (ii). 

Step 7. Consider now Iy(T) = Fy’), and assume P + 0. 
From step 6, |P| + 0. 

Step 8. Hence we can find A #0 such that |P — AH| = 0, for 
the constant term in this equation in A is just [P| + 0. 

Step 9. Consider @Q = P — AE and the vectors wy = Oy = 
0; — Aus. From (D.2) and (D.4) we have Tw; = ly (T)w,. 

Step 10. Now the argument of steps 5 and 6 applied to Q and the 
vectors wy, gives that either |Q| 4 0 contrary to our definition of Q 
from step 8, or Q = 0, i.e. P = XE which proves part (iii) of the lemma. 

Corollary. Part (iii) of Schur’s lemma can also be stated thus: 


a matrix which commutes with every matrix of an irreducible 
representation of some group is a multiple of the unit matrix. 


PROBLEMS 


D.1 Trace the similarities and differences between the above 
proof of Schur’s lemma with those found in Van der Waerden 
1932, Wigner 1931, and effectively in Speiser 1937. 

D.2 Two sets of functions ¢; and ¢'s, + = 1 to n, span the same 
vector space. They also transform according to exactly the same 
irreducible representation I;(7') under a group of transformations 
T. Prove that ¢; = Ad’; for all ¢, where A is a constant. 


Appendix E 


Irreducible Representations of Abelian Groups 


Theorem. All irreducible representations of an Abelian group 
are one-dimensional. Consider any representation D(A), Dy(B), ... 
of the Abelian group 6(A, B,...). In our case all the matrices are 
unitary so that any one of them can be reduced to diagonal form by 
an equivalence transformation of the type (5.15) (see appendix C, 
lemma 2 and problem C.2; Van der Waerden 1932, p. 26; Margenau 
and Murphy 1943, p. 316). 

Step 1. Let P~-1D(A)P = D’'(A) be the diagonal matrix 


We bo kh. ethene 


Ma. (E.1) 


and put P-1D(B)P = D’(B). Since © is Abelian 
D’(A)D"(B) = D'(B)D"(A). (E.2) 


Now the left side of (E.2) is just the matrix D’(B) with each row 
multiplied by the corresponding Aj, of D’(A), and the right side is 
D’(B) with each column multiplied by the corresponding A» of 
D(A). Thus D'y(B) = 0 if the A»m’s in the ¢ and j rows of D’(A) 
are not equal, ie. if D’g(A) 4 D'j(A) (%, 7 not summed). Thus 
D’(B) has the form (E.3) where the blocks correspond to equal 
diagonal elements in D’(A). 

Step 2. D’(B) can now be reduced to diagonal form by a trans- 
formation P’-1D'(B)P’ with P’ also having the form (E.3). This 
does not alter the D’(A) because we are only making transformations 
between rows having the same A». Thus both D(A) and D(B) have 
been reduced simultaneously to diagonal form. 
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Similarly all the other matrices can be reduced. For consider 
D(T) and suppose that the preceding transformations have trans- 
formed it to D’(T). Then by step 1, Dy’(T) = 0 if rows i and j 
contain different diagonal elements in any of the matrices that have 
already been reduced to the diagonal form (E.1). If they do not, 
then by step 2, Dy"(T) can be reduced to zero by a transformation 


On-=-------0 


Seen 0 (E.3) 


of the type (5.15), without altering the matrices that have already 
been diagonalized. Hence all matrices can be simultaneously 
diagonalized. 

Thus any representation of an Abelian group can be reduced to 
a sum of one-dimensional representations, and these are therefore 
the only possible irreducible ones. 


Appendix F 


Momenta and Infinitesimal Transformations 


Let 91, I -- - gn be a complete set of co-ordinates describing 
a system, and let 7’ be the transformation to new co-ordinates 
Qn where 


% = 4+ 50, u=Qn, n =2,3,...N, 


and where in accordance with the convention of § 5 we replace the 
Qn by qn after the transformation. Then analogously to (8.1), 
(8.2) we can define the infinitesimal transformation I, by 


T =1 4 11,89, + 0(39,’). (F.1) 


We shall prove in this appendix that J, is related to the momentum 
operator p, which is conjugate to the co-ordinate g, in the usual 
sense (Schiff 1955, p. 133), in fact that we have (cf. equation 8.29) 


| p, = hl, | (F.2) 


Let S be any operator, some function, and let the suffix ¢ denote 
transformed quantities after applying 7. Then 


T(Sp) = Safe = Se TY. 


Comparing left and right members of this equation, we must have 
(cf. footnote to equation (5.4)) 


S, = TST-1. (F.3) 


The variable q, is an operator in quantum mechanics as well as a 
co-ordinate occurring in a wave function. Hence putting q, for 
S and using (F.1), we obtain from (F.3) 


g, + 8q, = (1 + 12,84,)q,(1 — 11,8q,) + 0(89,°), 
i.e. qt, — Tyg, = +. (F.4a) 


Similarly 
dal, — IyQn = 9, n=l; 
Poly —Lypn=0, all n. (F.4b) 
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Now (F.4) shows that AJ, has exactly the same commutation 
relations as p, (Schiff 1955, p. 135). Thus the difference 


Py — hl, =f (pn, Gn) (F.5) 


commutes with all p, and all gn. Since it commutes with pa, f 
cannot depend on gy which is the only fundamental variable not com- 
muting with py. Similarly f does not depend on py, and is therefore 
a constant. We now operate with (F.5) on the wave function % = con- 
stant. This represents a free particle at rest since it depends on 
no co-ordinates and hence cannot represent any motion or changing 
co-ordinates. Consequently pny = 0, and also J, —=0. Hence 
Jf = 0, and since f is a constant, it must be zero. The result (F.2) 
now follows. 

Another important result also follows from (F.3). The space 
inversion J7 (3.11) commutes with every rotation R(«, ¢), so that 
from (8.3) it commutes with every infinitesimal rotation J,. Thus 


(F.6) 


Les = ITI,[I-4 — IIT I, = L,, 


and J, is invariant under JJ. Thus from (F.2) all the angular momen- 
tum vectors are invariant under J]. As regards the orbital angular 
momentum L this fact is already obvious from (8.27), but it also 
applies to J (8.30) and the spin angular momentum operators 
introduced in § 11. 


Appendix G 


The Simple Harmonic Oscillator 
The Hamiltonian 
Consider the Hamiltonian 


1 


(G.1) 


where m is the mass of a particle, q its position co-ordinate, and p 
the conjugate momentum. If this Hamiltonian is treated classi- 
cally, it can easily be shown that the particle executes simple har- 
monic motion with angular frequency w. Quantum mechanically, 
the operators p and q have to satisfy the commutation relation 


qp — pq = th. 


Instead of using p and q it is more convenient to introduce the 


operator 


a = (2mhw)-¥*(p + imag), 


and its Hermitian conjugate operator a* defined by 


| (aryyrg do = | yrag do. 
Since :~ and q are both self-conjugate, we have 
a* = (2mhw)-/2(p — tmwg). 
It is also convenient to define the operator 


N = aa"*. 
We then have 


N = aa* = (2mhw)-\(p + imwg)(p — imag) 
= (2mhw)—"[p? + m?w*g? + tmw(gp — pq)] 
= (2mhiw)-p* + m2w®g? — mhw). 


Thus H = (N + d)hiw = (aa* + bw 
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(G.2a) 


(G.3) 


(G.2b) 


(G.4) 


(G.5) 
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and in the same way it can be verified that 
a*a — aa* =I, (G.6a) 


a*a® — ata* = na"-!, (G.6b) 


Eigenfunctions and eigenvalues 
Consider a function ¥, with the property 


From (G.4) we have Ny, = 0, so that # is an eigenfunction of N 
belonging to the eigenvalue n = 0. Consider now the function 


Yn = (n!)-/2a%d,, (G.8) 


where 7 is a positive integer. From (G.6) and (G.7) we have 
Nin = (n!)-¥7aa*amyp, 
= (n!)-'?a[na"—! + ara* hh, 
= Nin, - 
so that , is an eigenfunction of N belonging to the eigenvalue n. 


It now follows from (G.5) that the %_ are also eigenfunctions of #7 
belonging to the eigenvalues 


En = (n + $)hw, 


where 7 is a positive integer or zero. It can easily be verified that 
the %n’s are normalized and orthogonal (problems G.1 and G.2). 

Physically we say that the energy level E, contains n quanta 
of energy fiw, and (G.8) is described by saying that each factor of 
a “creates” an extra quantum out of the ground state y¥,. It is 
this property (and see also (G.11)) which gives a and a* their names 
of creation and destruction operators. N is the operator that counts 
the number of quanta. 


Completeness of eigenfunctions 

We shall now show, following Dirac (1958), that no further eigen- 
functions of NV and 3 exist. Let ¥y be a normalized eigenfunction 
of N belonging to some eigenvalue k, which for the present is quite 
unrestricted. Then 


| vetIye do = &; 
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and from (G.3) and (G.4) 


[vet Nyx do = | (a*yn)*(a* de) do 2 0. 


Comparing these, we conclude that 
k = 0, (G.9) 


7 


and if a*%, = 0, then k = 0. (G.10) 
We also have from (G.4) . 
N(a*dp’) = aar*a* yy! = a* Noy! — at, = (ko — late’. (G11) 


Hence a*y;’ belongs to the eigenvalue k’ — 1, and by repeated 
multiplication by a* we can produce eigenfunctions 4, = (a*) "aby! 
belonging to the eigenvalues k = k’, k’ — 1, k’ — 2, k’ —3,.... 
If k’ is not an integer, then k never has the value zero, and from 
(G.10) we never have a*y, = 0. Thus the series of eigenfunctions 
goes on for ever, and sooner or later we reach a negative eigen- 
value k, which is contrary to (G.9). We conclude therefore that 
k’ can only have positive integer values, in which case our sequence 
of eigenfunctions terminates with the lowest one % satisfying 
(G.10). 
Analogously to (G.11), we have 


Nay! = (k' + Mabe’, 


so that ay,’ is an eigenfunction belonging to the eigenvalue k’ + 1. 
Thus starting from %, we can, by repeated multiplication 
by a, produce one eigenfunction belonging to each eigenvalue n. 
We have therefore derived the series of eigenfunctions %, (G.8) 
instead of just conjuring them out of a hat. It now remains to show 
that each eigenvalue is non-degenerate, in which case it follows _ 
that the yp are all the eigenfunctions. We show this by noting that 
N is invariant under the transformations 


R(8):— a —> ea, a* > eq; 
(G.12) 


IT:— a->—a, a* +> —a*; 


where @ is any (real) angle. These transformations commute, and 
there do not appear to be any others, so that the complete symmetry 
group of N is Abelian. All its irreducible representations are one- 
dimensional (appendix E), and we conclude that the eigenvalues 
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of N are non-degenerate because there are no further symmetry 
properties that could produce any degeneracy. 


Matrix elements 
Any operator F can be expressed in terms of p and g, and hence 
as & power series in a and a*. With the help of the commutation 
relation (G.6), any term in the expansion can be written Srs(a*)'ae. 
The matrix elements of this can be calculated easily from (G.8). 
We have 
<m|(a*)ra®}n> 


= (m'n!)-¥ a (A™ho)*(a*)"asa%y dv 
= (m!n!)-1/2 | Yor(ar)m+rast ms. dy, (G.13) 
This transforms under (G.12) according to 


exp i(s + n — m — r)8, 


so that (G.13) is zero unless m — n = 8 — r (from equation (13.8¢c) 
of the fundamental theorem of § 13). From (G.6b), (G.13) becomes 
form—n=s~—r 


<m|(a*)fa*|n> = (min!)-V%Xs + n)! | sho bo aap: (s + n)! 


(mlnly¥/2 
(G.14) 


PROBLEMS 


G.1 Using (G.13), show that the gm and yin (m ~ n) are ortho- 
gonal. Also verify using (G.3) that the wave functions (G.8) are 
normalized if we assume yg is. 

G.2 Calculate the matrix elements of g. 

G3 Calculate fyn*g2s,dv and compare this with the mean 
value of g? for a classical oscillator with the same energy. 

G.4 Show that [¢m*q"ndv=0 unless m—n=r, r— 2, 
y—4,...,—7. 


Appendix H 


The Irreducible Representations of the Complete Lorentz 
Group 


In this appendix we shall construct systematically all the finite 
irreducible representations of the complete Lorentz group &, 
following the notation of § 31. 

Step 1. Consider a vector space which is invariant under £. We 
can reduce it according to the proper Lorentz group I, and let 
Umm' be a set of standard base vectors in it transforming according 
to Di, If j, 7’ are both integers or both half odd integers, D‘J/” is 
single-valued, and otherwise double-valued (problems 31.6 and 
31.9). 

Step 2. Now II7 commutes with every proper and improper 
Lorentz transformation so that by Schur’s lemma it is represented 
by a multiple a# of the unit matrix ZH. Also (J17)* is the identity 
transformation, so that a = 1 or —1 in single-valued representa- 
tions. In double-valued representations we have a = +1 as one 
possibility because of the double-valuedness, and a = +7 as another. 

Step 3. Now we have that 


+ L{v,z)r = L(—v,z), i.e. + exp(t@lzr) = exp(—i8lzr) 1, 
whence by expanding the exponential we obtain 
Lor = —7 Iz7. 
Also 7 commutes with a pure rotation and hence with Izy. Therefore 
equations (31.10), (31.12) imply 
Az(TUmm') = (— Hayt — Hert)Umm’ 
= (— Bay + $71 z7)Umm' 
= —TByimn! = —m'(rUmm’), 
BtUmm') = —M(TUmm’). 
Thus the rumm’ behave as base vectors U_m’ =m. By operating also 
with 4,, A_, B,, B_ it is easily shown that the vectors S7umm’ in 
fact transform like (31.12) according to DI’) under I, where 8 = 
(—1)f+d/—m—m" 
Step 4. We also have ITL(v, z)IJ = L(—v, z) and JZ commutes 
with every pure rotation. Step 3 therefore applies also to [Zumm’. 
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Now [lumm' = 7(I1t)tumm' = GtUmm’ where a is given by step 2, so 
that Tumm’ is proportional to rumm’. The vectors Umm!’ and tumm’ 
therefore span a vector space which is invariant under $. Also the 
matrix representing JT is always uniquely determined by those 
representing J7r and +. 

Step 5. If j =j’, the umm’ and rumm’ both transform under | 
according to D‘)) and may be related. In fact if we put 


Umm’ = Umm’ + Stu_m’—m; Vinm' = Umm’ — S7rU_m’—m; 
we have since D' is single-valued and 72tmm’ = Umm’; 
TU mm’ = 8U —m' —m: TV nm’ = —8 V_m!' —m- 


Thus the Umm’ span a vector space of dimension (27 + 1)? which is 
invariant under £, and so do the Vinm’. Coupled with the two 
possibilities of step 2, this gives four different representations of $. 
We denote them by Dir", r — 0, 1, 2, 3, and they are related to 
one another in the same way (31.17) as the D‘43:*) discussed there 
in detail. 

Step 6. Ifj Aj’ the umm’ and rumm’ transform under I according 
to different irreducible representations and are therefore orthogonal. 
Hence to obtain a vector space invariant under § we must take 
the 2(2j + 1)(2j’ + 1) vectors tm’, Tumm' together. Since 7? is 
the identity transformation, 7 is represented with respect to these 
base vectors by D(r) where 


0 #£ 
Aa FE ol 
for single-valued representations, and 


D(r) = + F | (Hla) or + = "4 (H.1b) 


for double-valued ones. Coupled with two alternatives of step 2, 
this gives two different representations D\j’+'ii"), ¢ — 0, 1 when 
single-valued, and four representations D‘I'+U'", r— 4, 5, 6, 7 
when double-valued. In the case of (H.1b) it is convenient to use 
tvUmm’ instead of tumm’ as base vectors, in which case D(r) 
becomes 
.|0 £ 

ilo 
The representations Dt/’+/’ir) are therefore related like the 
Dii0+08;7) by 

DBE T) = x¢r(a)DIN'49'S O(T), 
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where « = E, IJ, 7, Iv as T belongs to the branches J, JJI, zl, [Jzl, 
and the x,;(«) are given by Table 37. The single-valued ones are 
related to one another like D4"), r = 0, 1 (equation (31.17)): the 
representations corresponding to r = 2, 3 are equivalent to those 
with r = 1, 0 respectively when j #7’. In all cases the represen- 
tation with r = 0 or 4 is defined to be the one obtained from (31.15) 
where t,9, U_o, Uy, Uo transform according to D‘tr*, The 
single-valued ones can also be obtained by reducing 


DAD yx DUO x DAKO y LL, 


Step 7. We now show that all the above representations are in- 
equivalent. Clearly this is so for those belonging to different pairs of 
values of j, 7’ because they are inequivalent under proper Lorentz 
transformations. Suppose now for j = j’ 


DI (T) = PDG:8(T)P-1, (H.2) 


i.e. that the two representations r and s (for short) are equivalent. 
By construction r and s are identical for proper Lorentz transfor- 
mations L, and from (H.2) P commutes with the irreducible repre- 
sentation DWir)(L) = Dis L) of the proper Lorentz group. Hence 
by Schur’s lemma P = cH where c is a constant. Then from (H.2) 
r and s are completely identical. Suppose similarly for j + j’ 


DIED) = PDW'+1'58(T)P-, (H.3) 


The representations are again equal for proper Lorentz transforma- 
tions. However they become reducible 


D0W"+1'7)(L) —_ Ke aI 
2 


when we restrict ourselves to proper Lorentz transformations. 


Hence if we write 
Po oF 
P ee 1 | , 
LP 


Schur’s lemma gives P, = c,#, P, =c,H, P; = P,=0. By using 
this P and putting T = 7 and J/7 in (H.3), it can easily be verified 
that the constants c,, c, cannot be chosen such that (H.3) is satisfied 
if r ~.s. Thus all the representations described are inequivalent. 
Step 8. Given a finite vector space R which is irreducible under 
£, we can construct in this space one of the above representations 
by the procedure we have used. Using Az, B, we first pick out from 
R a vector wy’ with the highest eigenvalues m = j, m’ = j’. From 
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it using A_, B_ (31.12) we construct all the Umm’. Then from steps 
4 and 5 we obtain in a unique way a definite set of base vectors. 
Now in steps 2, 5, 6 we listed all possible alternatives, so that our 
base vectors must transform according to one of the representations 
Dit), r = 0,1, 2,3 or DWI, ry — 0, 1 (single-valued) or r = 4, 
5, 6, 7 (double-valued). Hence the irreducible space X transforms 
according to a representation which is equivalent to one of these. 
It follows that we have derived all the finite irreducible representa- 
tions of the complete Lorentz group. 


Appendix I 
Table of Wigner Coefficients (4j’mm'|JM) 


Note: Reading down one column in the table gives the coefficients 
in the expansion 


Wa) = > (ij’mm' [JMU V 99, (20.1) 


and reading across one row gives the coefficients in 


Um OV m9) <= > (ij’mm'|IM)W ue. (20.5) 
JM 


The tables are taken from Cohen (1949), who also gives a table for 
Di‘3/2) x Dts/) not included here. Algebraic tables have been 
given by the following: 

DY x DO), DO x D®, D® x D), Condon and Shortley (1951); 
D® x D'3/2), Cohen (1949); 

D® x D's/2), Melvin and Swamy (1957); 

D® x D®), Falkoff et al. (1952). 


Extensive numerical tables in decimal form have been published 
by Rose (1957) and Simon (1954). 


D, X Dy, 


142 1/2 
Oni he Vite 


+ The tables are reproduced here through the kind permission of Dr. Cohen. 
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D; X D, 
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i es ee , 


8/2 71 

Osis Viiis 4 2 
3/2 1/2 

Urj2 Vials Vi —vV/4 
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Appendix J 


Notation for the Thirty-two Crystal Point-groups 


International Full symmetry Schoenflies 
symbol symbol symbol 
1 1 Cc: 
I I S,(C%) 
2 2 Cz 
m m Cin(Cs) 
2/m a Con 
2mm 2mm Cav 
222 222 DV) 
mmm 2 ka = Den(V nr) 
4 4 C, 
4 4 Sy 
4/m “ Can 
4mm 4mm Cay 
4 2m 4 2m Deal Va) 
422 422 Ds 
422 
4/mmm species Dan 
3 3 C3 
3 3 Se(Cst) 
3m 3m Czy 
3m 3 = Dea 
32 32 D; 
6 6 Ce 
6 6 Csr 
6/m e Cen 
m 


International 


symbol 
6 mm 
6 m 2 
622 
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Full symmetry 


symbol 


Appendix K 
Character Tables for the Crystal Point-groups 


Notation 


The point-groups are designated by their international symbols 
which are explained in § 16. The corresponding Schoenflies symbols 
may be found in appendix J. 

The classes of group elements are indicated by one typical element 
from each class. These elements are given by using an extension 
of the international notation for the groups. The symbol n now 
stands for a rotation by 360/n degrees and n* for 360 r/n degrees. 
The rotation axis is given by a suffix, e.g. x, y, z, and d meaning an 
axis at 45° to Ox and Oy. The symbol m, thus means a reflection 
in a plane perpendicular to the z-axis. 7 is an improper rotation by 
360/n°, ie. a proper rotation by this angle followed by inversion 
through the origin. The inversion JJ (3.11) itself is thus denoted 
by 1. 

The irreducible representations are labelled systematically as 
follows. A and B always denote one-dimensional representations, 
B being used if a rotation by 360/n° about the principal axis (chosen 
as z-axis) has the character —1. EF is used for a two-dimensional 
representation, and 7’ for a three-dimensional one. A pair of 
complex conjugate one-dimensional representations are always 
bracketed together and regarded as a two-dimensional representa- 
tion H, because for most purposes they behave as such due to time- 
reversal symmetry (§§ 19 and 23). If there are two representations 
in which the characters of mz differ in sign, they are distinguished 
by ‘ and ”. Subscripts g and u (German, gerade and ungerade) 
refer to positive and negative characters of the inversion JJ (or 
I in the above notation). When this system allows several different 
labellings, the use of u and g takes preference over ' and ”, which 
in turn takes preference over suffices 1, 2, etc. We have followed 
this scheme consistently in this appendix, though this has meant 
introducing minor differences from the notation of other authors. 

Some point-groups are direct products g x 1 (§ 15), where g is a 
proper point-group and I the inversion group (EZ, JT). Their character 
tables are not given explicitly but may easily be constructed as 
follows. Each representation D of g gives rise to two representations 
Dg and Dy of the group g x 1. In these the proper rotations R 
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have the same characters as in D, but the improper rotations I7R 
have the characters 
x(TR) = x(R) in Dg, 
= —y(R) in Dy. 


The classes are as follows. Suppose R,, R,, .. . form a class of g. 
Then the proper rotations R,, R,, ... of the group g x I again 
form a class, and the corresponding improper rotations JTR,, 
ITR,, . . . form another class. Thus g x I has twice as many classes 
as g, which checks with the fact that it also has twice as many 
representations (cf. equation (14.16)). The above construction for 
the character table of g x I follows deductively from the theory 
of § 15, or can be justified @ posteriori by noting that the character 
table satisfies all the orthogonality requirements, etc., of § 14. 

On the right side of each character table are listed the quantities 
XL, Y, 2, x, y*, 24, xy, yz, zu, Iz, Iy, Iz in the row of the representation 
to which they belong. Here Iz, I, Iz are the infinitesimal rotation 
operators which transform as a pseudovector. The chief use of this 
is in connection with the vibrational modes of molecules as sum- 
marized in Table 23. In the case of groups g xX I whose character 
tables are not given in full, the quantities x, y, z always transform 
according to the appropriate wngerade representations, Dy, and 2%, 
y*, 2", xy, yz, 2x, Iz, Iy, Iz according to gerade representations Dg. 


Triclinic, monoclinic and orthorhombic point-groups 


ij # I 


Ag 1 1 x; y?; 22; wy; yz; 2x3 
Iz; ys £2. 
Ay 1-1 XL; Y; 2. 


A LoL | 2 y?; 24; 2; ay; I. 
B 1 —1 | 2; y; 22; yz; Iz; y. 


A’ 1 1 | 2&3 y; 2; y?; 2; ay; Iz. 
A” 1 —] 2; yz; xz; Iz; Ty. 
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922 | BH 2, Q 2% 


A 1 1 LoL | a; y?; 22. 
B, 1-1-1 1 | 2; ay; JZ, 
B, 1-—l 1-1 | y; a2; I, 
B; 1 1-1 —1 | 2; yz; Iz 


Here A, B,, B,, Bs are used for 222 because any one of the two- 
fold axes can be considered the principal one. 


mmm = 222 x I 


2mm E 2; My Mz 


A, I eke ae Sb Spat ts e?. 
A, 1 1 —1 —1 Ka] Bs q,. 

B, 1—1 1 —1 | a; xz; Ly. 
B, 1-1 —1 1 | y; yz; Ix. 


1 1 x; y; 22: wy: Te. 


Ag 1 1 

Bg 1 —1 —1 1 | ye;a2; Iz; Ty. 
Ay 1 1-1 -—1l Zz: 
By 1-1 1-—l x3 Y; 


Tetragonal point-groups 


au + y?; 2%; Iz. 
x? —y?; xy. 

®, Y; U2, YZ; 
ve Ty. 
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A, Lolo Ll 1 1 | a + y?; 22, 

A, I 1 1-1-1 [ea d,. 

B, 1 L~tl 1-1 | 2 — y?, 

B, | 1 1-1-1 1 fay. 

EB 2-2 0 0 O | a, y; 22, yz; Iz, ry. 


A, LoL Ll 2 od | a; 2 4 y?; 22, 

A, 1 1 1-1-1 | Xk. 

B, 1 12-l 1-1 | a? — y?, 

B, 1 1~1~—1 1 | zy. 

E 2-2 0 0 O | a2, y; xz, yz; Iz, Ty. 


A, 1 1 1 1 1 | a + y2; 22 

Az {1 1 1-1-1 [&. 

B, 1 1-—i 1-1 | at—y? 

B, 1 1-1 ~—1 1 | 2; ay. 

E 2-2 0 0 O | &, y; xz, yz; In, Iy. 
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Trigonal and hexagonal point-groups 


2; a? + y*; 23; Tz. 
x,y; x* — *, wy; 


3=3x i 


a + y?; 2°. 
23 Li. 
x, y; x? — y?, xy; az, yz; Iz, Ly. 


The representations A,, A,, H of this group are the same as J, 
&, I of Tables 3 and 8. 


3m = 32 x I 
3m. 
A, 2; 22? + y*; 22, 
A, rAd 
E x, y; u* — y*; ay; xz, yz; Iz, Ty. 


6 E 62 3z 2: 32” 625 


1 1 1 | 2; 2 + y?; 2%; Tz. 


A ] 1 l 
B 1 —1l 1 —l 1 —l 
E 1 —w? w —1 w? —w L,Y; LZ, Y2; 
1 1 —-w ow? —1 wow —w*|\ Iz, Ly. 
1 w w? 1 7) w2 
Hy { 1 we w 1 wt w hat — ¥?, wy 


where w = exp(27t/3). 
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6/m = 6 x I 


au? +. y?; 28; Ip. 
2. 


ha, y; x2 — y*, ay. 


xz, yz; Iz, Ty. 
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Cubic point-groups 


E22 3 = 3 
1 1] 1 od | w+y?%?+4+22 = 72, 
2 
: : if . has — y*, 32% — r?, 
3-1 0 O | 4, y, 2; xy, yz, zu; Ig, Ty, Iz. 


at 4 y? 4 2% = 72, 


x, Y, 2; Ix, Ty, Iz. 


1 
I 
O | x® — y*, 322 — r?. 
1 
1 | xy, yz, 2x. 


m3m = 432 x 1 


E 3 2. Mg 4, 


Appendix L 


Character Tables for the Axial Rotation Group and Derived 
Groups 


Notation 

The notation follows that of appendix K as closely as possible. 
The groups com and oo/mm are denoted by C.., and D,,, in the 
Schoenflies notation. 


co | £  R(¢, z) 
A 1 1 z; 29; 2% + y?: Te. 
E { 1 exp(t¢) L,Y; U2, Y2; 
: 1 exp(—t¢) | Iz, Ly. 
1 exp(i2¢) 
Bat] 1 cxpt—aap [fet ~ 9 a 
Bm{| 1 Sal imd) 


1 1 1 | 2; 235 vw? + y?, 
1 1 —-1 | dz. 
E, =TII 2 2 cos d O | x, y; x2, yz; Iz, Ty. 
=A 2 2cos26 O | a? — y?, ay. 


eee eee ete e fF ee ij#*~seeenve 
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co/mm E R(¢,z) 22 1 IR(¢,z) mz 


1 1 1 1 1 Ll | 235 4% + y?, 
1 l 1-—l -—l —1 
1 1 -1 1 1 —] 
1 1 —-1-—-1 -—-Il 1 
Eig=TIlg | 2 2c0o8¢ O 2 2cosP O | az, yz Iz; Ty. 
2 2cosed 0-2 —2cosd 0 
2 2cos24 0 2 2 cos 2¢ 0 
2 2cos2¢ 0 —2 —2cos2¢ 0 
2cosmp 0 2 2 cos mp 0 
2cosmd 0 —2 —2cosmd 0 


coo/m = 0 x I 
oo 2 is isomorphic with com. 


GENERAL REFERENCES 
Group theory applied to quantum mechanics 


B. L. Van Dern Waxrpen; Die Gruppentheoretische Methode in der Quanten- 
mechanik, 1932. After 4 summary of the relevant parts of quantum mech- 
anics in Chapter I, Chapter II develops in 32 pages all the necessary general 
theory of vector spaces and group representation from first principles. The 
whole treatment is extremely concise, neat and elegant, yet without being 
more general and abstract than is required for the purpose at hand. The 
style is terse, occasionally to the point of making the argument difficult to 
follow on a first reading. The properties of group characters are developed 
but not used. In succeeding chapters the irreducible representations of the 
rotation group and the Lorentz group are derived, and then applied to the 
theory of atomic spectra. The final chapter deals with the applications to 
diatomic molecules. It appears that at least three different unpublished 
translations of this book are in existence for purpose of private study. 

E. WIGNER; Gruppentheorie und ihre Anwendung auf die Quantenmechanik 
der Atmospektren (translated Group Theory and Its Application to the Quantum 
Mechanics of Atomic Spectra), 1931. Although the subject matter is very 
similar to that of the previous book, the treatment is somewhat different. 
The representation theory is developed from the point of view of the group 
characters, even for continuous groups. In the discussion of the rotation 
group this makes for rather more algebraic manipulation. The exclusion 
principle is taken into account via a detailed discussion of the representations 
of the permutation group (as outlined in § 28), because it had not been realized 
at the time that this could be avoided. Altogether the book is a mine of infor- 
mation on the detailed manipulation of the rotation and permutation groups, 
but this has the consequence that the wood is often lost for the trees and the 
present author finds the book rather unsuitable as a first introduction to the 
subject of Group Theory in Quantum Mechanics in general.f 

H. Wey1; Gruppentheorie und Quantenmechanik (translated The Theory 
of Groups and Quantum Mechanics), 1931. This book is considerably more 
difficult reading than either of the previous two. However, after having mas- 
tered it, many people feel that they have a much deeper and more satisfying 
understanding of the whole subject than they had before, for in this book 
the actual structure of the quantum theory is developed from a group- 
theoretical point of view. As regards specific applications, the rotation, 
Lorentz and permutation groups are discussed in detail and applied to atomic 
spectra and to Dirac’s relativistic theory of photons and electrons. 

H. Eyring, J. Waiter and G. E. Krmsai; Quantum Chemistry, 1944. 
From the present point of view, the importance of this book lies in the fact 
that it is the first and best known standard general text book which has 
dared to assume that elementary group theory is, or should be, part of a 
modern student’s equipment just as much as being able to write down the 


t Note added in proof; An English translation in a revised and enlarged edition has recently 
been published (E. P. Wigner, 1959. Group Theory and Its Application to the Quantum Mechanics of 
Atomic Spectra, Academic Press, New York and London). Three new chapters have been added: one 
on Racah Coefficients; one on Time Reversal Symmetry; and one on the Physical Interpretation 
and Classical Limits of Wigner and Racah Coefficients. 
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Schrédinger equation. The elements of group theory are given in very con- 
densed form in one chapter near the middle of the book, very much from the 
point of view of getting some useful algebraic relations about group charac- 
ters for use as a tool later on in the calculation of molecular energy levels 
and vibrations. The more general group-theoretical attitude towards degen- 
eracy and selection rules is not stressed. 

L. D. Lanpav and E, M. Lirsurrz; Quantum Mechanics, Non-Relativistic 
Theory, 1958. This general quantum mechanics text also includes a chapter, 
which is very clearly written, on group-theoretical symmetry arguments. 

C. Ecxart; The Application of Group Theory to the Quantum Dynamics of 
Monatomic Systems, Rev. Mod. Phys. 2, 304, 1930. This review article of 75 
pages develops the theory from first principles, together with the simpler and 
more important applications to atomic spectra. 

H. Marcenav and G. M. Munruy; The Mathematics of Physics and Chemis- 
try, 1943. This volume includes a chapter on group theory, which contains 
some useful information about groups and their irreducible representations. 
The discussion of its application to quantum mechanics is so brief as to be of 
not much help to the reader. 

S. Buacavantam and T. VENKATARAYUDU; Theory of Groups and Its 
Application to Physical Problems, 1948. The book includes an introduction 
to group theory and its use in physics, including quantum mechanics. The 
principal application is to vibrations in molecules and solids. The discussion 
of solids is incorrect in parts (at least it seems so to the present author). 


Mathematical theory of group representations 


A Sprismr; Theorie der Gruppen von Endlicher Ordnung, 1937. From the 
point of view of the student of quantum mechanics the importance of this 
book lies primarily in its discussion of group representations, which forms a 
very satisfactory alternative reference to Van der Waerden (1932) as an 
elementary but systematic and rigorous presentation of the basic theory. 
The emphasis is on finite groups, group characters and their orthogonality 
relations, so that the treatment appeals particularly to those who have in 
mind the applications to molecular problems. Chapter 6 with its beautiful 
illustrations of two-dimensional space-groups is also noteworthy. 

G. Brexnorr and S. MacLane; A Survey of Modern Algebra, 1941. Chapters 
6 to 9 of this standard text book give an introductory account of the algebra 
of groups, vector spaces, matrices and linear transformations. Although 
not all the aspects required in quantum mechanical applications are included, 
the treatment is particularly noteworthy for its simple style and for the 
many examples which illustrate the meaning of the various terms as they are 
introduced. 

W. Leperman; Introduction to the Theory of Finite Groups, 1953. This 
gives a good introduction to groups, but does not mention group representations, 

F. D. Munnacuan; The Theory of Group Representations, 1938. A rather 
advanced account of the theory of group representations is given, with 
particular reference to the permutation, rotation, Lorentz and crystallographic 
point-groups. 

H. Borrner; Darstellungen von Gruppen mit Bericksichtigung der Bedirf- 
nisse der Modernen Physik, 1955. The mathematical theory is presented of 
the representations of the permutation group and of continuous groups of 
non-singular linear transformations in »-dimensions {including the unitary, 
rotation, Lorentz but not simplectic groups). 
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INDEX 


A 


Abelian group, 14, 49, 124, 420. 

Accidental degeneracy, 44, 222, 278, 
301. 

Ammonia, 241. 

Angular distributions, 339. 

Angular momentum, 62—4, 80: coup- 
ling of, 67—72, 176-89. 

Anharmonicity, 256. 

Antisymmetric representation, 51, 90. 

Axial rotation group, 15, 55, 67, 
218: representations, 49, 136, 455-6. 


B 


Band structure, 271-303: 
together of bands, 285-93. 

Base vectors, 29: ‘“‘ standard,” 58, 
66, 354. 

Basis, for representation, 28. 

Benzene, 213, 227, 244. 

Beta decay, 164, 384-396. 

Bloch function (orbital), 216, 266, 281. 

Body centred cubic structure, 265, 
268, 280. 

Brillouin zone, 267: special points, 
278: extended zone scheme 282: 
reduced zone scheme, 268. 


sticking 


Cc 


Carbon, 210. 

Carbon dioxide, 241, 263. 

Central self-consistent field Hamil- 
tonian, 75, 

Cerium ethylsulphate, 149. 

Characters, 114-124: for character 
tables of particular groups, see 
under names of groups, particularly 
Point-group. 

Charge conjugation, 376, 384, 397. 

Charge independence of nuclear forces 
321-333. 

Chromous sulphate, 155. 

Classes, 114, 122, 124-5. 

Clebsch-Gordon, see Wigner coeffi- 
cients. 


. Electromagnetic field, 349: 


Combination tone, 252. 

Commutation, 14, 146, 419. 

Commutation relations, 54, 352-4, 
367, 368, 423. 

Compatibility relations, 277. 

Complex conjugate representations, 
171, 235, 448. 

Configurational interaction, 77, 206. 

Configurations, atomic, 75, 91, 92, 
98: of the elements, 93: molecular, 
221: nuclear, 327. 

Constants of the motion, 144-7, 337, 
348. 

Continuous groups, 113, 125, 144. 

Covalent bond, 208-21. 

Crystal field splittings, 47, 
148-163. 

Cubic group, see point-group. 

Cyclic group, 48. 


112, 


D 

DM, 58, 

Degeneracy, 2, 29, 42: accidental, 
44, 222, 278, 301: splitting by 
perturbations, 44: due to time 
reversal, 171-3, 291, 448: in atoms, 
75-7, 84-5, 90-3: in molecular 
vibrations, 235, 258-61, 263: in 
solids, 278-9, 285-8, 291-303: due 
to isotopic spin, 325. 

Determinantal wave functions, 91, 
206. 

Deuteron, 328-30, 333, 337-8. 

Dimension of vector space, 26, 30, 412. 

Dirac equation, 366-383. 

Dirac method, 143-7. 

Direct inspection method, 310. 

Double suffix notation, 406. 

Double-valued (spin) representations, 
59-60, 137-41, 226, 293, 359-61, 
363. 


E 


Elastic constants, 305-12. 
vector 
potential, 383. 
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INDEX 


Element of a group, 14. 

Energy levels, crossing, 
also Degeneracy. 

Equivalence, of representations, 31-3, 
115, 119, 123: transformation, 32. 

Equivalent orbitals, 94. 

Eulerian angles, 60. 

Exchange, 91. 

Exclusion principle, electrons, 89-99, 
376, 403: nucleons, 315-21: posi- 
tronium, 3989, 403. 


222: see 


F 


Face centred cubic structure, 265» 
268, 280, 281. 

Favoured transitions, 388. 

Finite groups, 113-25, 145: see also 
Point-groups. 

Forbidden transitions, 386, 394-5. 

Free electron model, 282. 

Full rotation and reflection group, 
15, 125-8, 

Full rotation group, 15, 51-72: 
irreducible representations, 55-61: 
characters, 117: reduction of pro- 
duct representation, 67-72. 

Fundamental theorems, of group 
theory, 412-19: on degeneracy of 
energy levels, 41-7: on matrix 
elements, 100-3, 120: on reactions, 
335-7, 348: on tensor components, 
309. 


G 


g-factor, 111, 153, 160-2. 

Gamma rays, 349. 

Gamma matrices, 367-87: definition, 
367-8: transformation properties, 
369: uniqueness, 370-2. 

General references, 457-8. 

Germanium, 296. 

Glide planes, 285, 285-302. 

Group, definition, 14: for specific 
groups and types of group, see 
under their respective name, e.g. 
Point-group. 

Group of k, 275, 299. 


H 


Hamiltonian, of free atom, 73: 
symmetry transformations of, 6-11, 
19. 
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Hartree’s equations, 74. 

Hartree-Fock equations, 91, 96. 

Heitler-London wave function, 218. 

Homomorphism, 410. 

Hund’s rule, 97. 

Hydrogen molecule, 10, 206-8, 216-8, 
228, 

Hyperfine structure, 189-205: s-elec- 
tron effect, 198. 


I 


“Identical ’? representations, 33. 
Identity representation, 26, 103, 120. 
Improper rotations, 9, 129, 131, 238. 
Indium antimonide, 296. 
Infinitesimal rotation operators, 52— 
72, 106, 351-5: definition, 52: 
commutation relations, 54-5, 352-— 
4: relation to angular momentum, 
62-4: relation to spin, 80, 375. 

Infinitesimal transformations, 422-3. 

Infra-red spectra, 239-41, 245-64. 

Inversion (in space), 9, 86, 356: 
group, 41: centre in point-groups, 
448: centre in solids, 295: in polar 
coordinates, 10: see also Parity and 
Time inversion. 

Irreducible representations, defini- 
tion, 34: in vector space, 35: 
specification by characters, 115: 
number of, 122: test for, 124: of 
specific groups, see name of group 
or type of group, e.g. Point-group. 

Isomorphism, 20-2, 410-11. 

Isotopic spin, formalism, 313-21: 
and nuclear forces, 322-34: and 
reactions, 337-9, 346-9. 


J 
Jahn-Teller effect, 175. 
jj-coupling, 86. 

K 
Kramers’ theorem, 166, 169-70. 


L 


Landé splitting factor, 111. 
Level, of a term, 85: see also Degener- 
acy and Energy level. 
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Lorentz group, 351-63, 428: trans- 
formation under, 365-70, 379-81, 
385. 


M 


Magnetic field, Hamiltonian, 153, 
374; Zeeman effect, 111: Zeeman 

splitting in crystals, 148-62: effect 
of time reversal, 170, 383. 

Magnetic moment, of electron, 78, 
375: of nucleus, 190, 192. 

Matrix algebra, 404. 

Matrix elements, 2, 99-112: Funda- 
mental theorem on selection rules, 
100-3, 120: with Wigner and 
Racah coefficients, 183-6. 

Mesons, 331-2, 334, 338, 346. 

Molecular, wave functions, 206-228: 
vibrations, 229-44, 245-64. 

Multicomponent functions, 81-3, 88, 
365-6. 


N 


Negative energy states, 373, 376. 

Neutrino, 385, 391: two component, 
396. 

Normal coordinates, 232-3, 239. 

Nuclear, forces, 321-33: spin, 190: 
moments, 192. 


oO 


Octahedral complexes, 227. 

Operator equivalents, 153, 163. 

Orbital angular momentum, 80: 
** quenched,” 159. 

Orbitals, atomic, 74: in molecules, 
206-28. 

Orthogonality relations, 117~—8, 123. 

Overtones, 252. 

Oxygen molecule, 220, 229. 

Ozone, 16, 233-40, 248-52, 259-60, 
262: see also ‘‘ 32.” 


P 


Paramagnetic ions, 148-163. 

Parity, 86: of spin functions, 87, 361, 
398: of spherical harmonics, 87: 
atomic, 87, 104: nuclear, 190, 328, 
341-3: of fundamental particles, 
331, 334, 360: non-conservation of, 
386, 390, 395. See also Inversion. 


Paschen-Back effect, 111. 

Periodic boundary conditions, 266. 

Periodic table, 92. 

Permutation (symmetric) group, 9, 
15: of order two, 41: of order three, 
21: of order n, representations, 50: 
see also Exclusion principle. 

Perturbation, splitting of levels, 44-6. 

Perturbation theory, degenerate, 106: 
spin-Hamiltonian form, 160, 223: 
for band structures, 283. 

Piezoelectric constants, 312. 

Point-groups, 15, 128-142: list of, 
130, 131, 135, 446: international 
and Shoenflies notation, 129-31, 
446: character tables, 136-7, 448- 
54, 455-6: double-valued repre- 
sentations, 137-41: point group 
three-two, 13-5, 27 (irred. reps.), 
115 (characters). 

Positron, 376: positronium, 397-403. 

Product groups, 125-8. 

Product representation, 67, 
symmetrical, 260, 308. 

Projection operators, 119. 

Pseudo-vector, 89, 358, 383. 


116: 


Q 


Quadrupole moment, 201, 329. 
Quantum numbers, 75-6, 84. 
Quenching of orbital moment, 159. 


R 


Racah coefficients, 183-8, 202. 

Raman spectra, 239-41, 248-64: 
in solids, 284. 

Reactions, nuclear, 334-50: funda- 
mental theorem on, 335-7, 348: 
selection rules, 337-8, 341-3: angu- 
lar distribution of products 339—46: 
relative cross sections, 338-9, 
346-8. 

Reciprocal lattice, 267, 280. 

Reduced wave vector, 268, 

Reducibility, 33-7, 416. 

Reduction of a representation, 67, 
115. 

Refiection, 9. 

Representation(s), 25-9: reducible, 
irreducible, 33-7, 416: reduction 


INDEX 


of, 67, 115: equivalence, 31-3: 
‘identical,’ 33: identity repre- 
sentation, 26, 103, 120: complex 
conjugate, 171, 235: unfaithful, 
26: double-valued (spin), 59, 137, 
226, 293, 359, 363: regular, 121: 
of particular groups, see group or 
type of group, e.g. Point-groups. 

Relativistic, wave equation, 363-6: 
transitions, 387, 395. 

Rotations, 3-7, 11, 52-4: notation, 
4: composition of, 65: Eulerian 


angles, 60: see also Improper 
rotations, Infinitesimal rotation 
operators, Full rotation group, 


Axial rotation group, Point-groups. 
Russell-Saunders coupling, 85, 97. 


s 


Schur’s lemma, 101, 118, 415, 418. 

Screw axes, 285-7, 296, 299-301. 

Selection rules, Fundamental theo- 
rem for, 99, 102-3, 120: electric 
dipole transitions, 103: magnetic 
dipole, 110: molecular spectra, 240, 
247, 249: beta decay, 386-8, 395: 
gamma emission, 349, 350: reac- 
tions, 337-9: positronium, 401. 

Self-consistent field, 74, 91. 

Shell model, 326. 

Simple harmonic oscillator, 6, 233, 
252-4, 256-61, 264, 424-7. 

Space-groups, 128, 265, 284, 265-303: 
summary of irreducible representa- 
tions, 273-6, 299-300: spin rep- 
resentations, 293. 

Spectroscopic notation, 75, 218. 

Spherical harmonics, 58-9: parity, 
87: characters, 117: matrix ele- 
ments, 149-53. 

Spin, electron, 78-9, 365-6, 375: 
nuclear, 190, 330: other particles, 
331, 365: functions, 82-3 (defini- 
tion), 60, 83 (transformation), 87, 
360, 361, 398 (parity), 167 (time- 
reversal): operators, 79, 88: mat- 
rices, 88. 

Spin—, -Hamiltonian, 156-64, 223: 
-orbit coupling, 81, 107, 226, 293, 
375, 397 (electrons), 329, 334 
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(nuclei): -spin coupling, 81: -rep- 
resentations (double-valued), 59— 
60, 137-41, 226, 293, 359-61, 363, 

Spinor, 66, 83, 177, 355: fields, 360, 
365: invariants, 177, 

Splitting of levels, 1, 44-7: see 
also Degeneracy and Perturbation 
theory. . 

Square lattice, 268-283. 

Star of k, 274, 299. 

Stereogram, 129-30. 

Sub-group, 20. 

Sum rule, 109. 

Symmetric group, see Permutation 
group. 

Symmetrical product, 260, 308. 

Symmetry, consequences of, 1-3: 
transformations, 3-6: of Hamil- 
tonian, 6-12, 16-20. 


r 


Tensors, 70, 72, 184, 304~12. 

Terms (energy levels), atomic, 83-6, 
93-7, 107-110: molecular, 221, 225, | 
229: positronium, 401. 

32 (point-group “‘ three-two ”’), 13-5: 


irreducible representations, 27: 
characters, 115: see also Point- 
groups. 


Time-inversion, 164, 356, 428. See 
also Time-reversal. 

Time-reversal, 164—75: and complex 
representations, 235, 448: in band 
structures, 290, 295: symmetry of 
tensors, 312: Dirac equation, 383: 
beta decay, 396. See also Time- 
inversion. 

Transformations, 3-6, 12, 24, 30: of 
Hamiltonian, 6-12, 16-9: of wave 
functions, 8, 24, 41: of operators, 
25 footnote: of multi-component 
functions, 81-3, 88, 365-6: equi- 
valence, 32: unitary, 409, 414: 
relativistic, 379. 

Transitions, see Selection rules. 

Translation group, 265. 


U 
Unfavoured transitions, 388. 
Unitary, matrix, 408: transforma- 
tions, 409, 414. 


468 


Vv 


Valence, bond orbital, 208, 213. 

Vector coupling, see Wigner coeffi- 
cient. 

Vector potential, 383. 

Vector space, definition, 29: reduc- 
ibility, 35: mathematical theory, 
412-7: application to quantum 
mechanics, 41. 

Vectors in ordinary space, 59, 70, 
358: pseudo-, 89, 358, 383. 

Vibrations, of molecules, 
245-64: in solids, 284. 


229-44, 
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WwW 


Wave functions, transformation of, 
8, 24, 41: multicomponent, 81-3, 
88, 365-6. 

Wave vector, 
282. 

Wigner coefficients, 
tables of, 432-45. 


266: reduced, 268, 


70-2, 176-89: 


XYZ 


Zeeman effect, 111: in crystals, 


148-62. 
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